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METHODS AND COMPOSITIONS FOR INHIBITION OF MEMBRANE 
FUSION -ASSOCIATED EVENTS , INCLUDING HIV TRANSMISSION 

1 . INTRODUCTION 
The present invention relates, first, to DP178 
(SEQ ID NO:l), a peptide, also referred to herein as 
T2 0, corresponding to amino acids 63 8 to 673 of the 
HIV-l^j transmembrane protein (TM) gp41, and portions 
or analogs of DP178 (SEQ ID NO:l), which exhibit anti- 
membrane fusion capability, antiviral activity, such 
as the ability to inhibit HIV transmission to 
uninfected CD-4* cells, or an ability to modulate 
intracellular processes involving coiled-coil peptide 
structures. The present invention also relates to 
peptides analogous to DP107 (SEQ ID NO:25) , a peptide, 
which is also referred to herein as T21, corresponding 
to amino acids 558 to 595 of the HIV-1^, transmembrane 
protein (TM) gp4l, having amino acid sequences present 
in other viruses, such as enveloped viruses, and/or 
other organisms, and further relates to the uses of 
such peptides. These peptides exhibit ant i -membrane 
fusion capability, antiviral activity, or the ability 
to modulate intracellular processes involving coiled- 
coil peptide structures. 

The gp4l region from which DP107 is derived is 
referred to herein as HRl . The gp41 region from which 
DP178 is derived is referred to herein as HR2 . As 
discussed herein, the gp4l HRl and HR2 regions 
interact (non-covalently) with each other and/or with 
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T2 0 and T2l peptides. This interaction is required 
for normal infectivity of HIV. 

The present invention therefore additionally 
relates to methods for identifying compounds, 
including small molecule compounds, that disrupt the 
5 interaction between DP178 and DP107, and/or between 
DP107-like and DP178-like peptides. In one 
embodiment, such methods relate to identification and 
utilization of modified DP178, DP178-like, DP107 and 
DP107-like peptides and peptide pairs that interact 
10 with each other at a lower affinity than the affinity 
exhibited by corresponding "parent" or "native" 
peptides. Further, the invention relates to the use 
of DP178, DP178 portions, DP107, DP017 portions and/dr 
analogs and other modulators, including small 
molecules modulators, of DP178/DP107, 
DP178-like/DP107-like, or HR1/HR2 interactions as 
antifusogenic or antiviral compounds or as inhibitors 
of intracellular events involving coiled-coil peptide 
structures. The invention is demonstrated, first, by 
way of an Example wherein DP178 (SEQ ID:1), and a 

2 0 

peptide whose sequence is homologous to DP178 are each 
shown to be potent, non-cytotoxic inhibitors of HIV-1 
transfer to uninfected CD-4* cells. The invention is 
further demonstrated by Examples wherein peptides 
having structural and/or amino acid motif similarity 

25 to DP107 and DP178 are identified in a variety of 

viral and nonviral organisms, and in examples wherein 
a number of such identified peptides derived from 
several different viral systems are demonstrated to 
exhibit antiviral activity. The invention is still 

3 0 further demonstrated by way of Examples wherein other 
DPl7B-like and DP107-like peptides are identified that 

- 2 - 
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interact with their corresponding HR1 and HR2 domains 
with a lower affinity than the affinity exhibited by 
the native DP178 or DP107 peptide from which they are 
derived. 

5 2. BACKGROUND OF THE INVENTION 

2 . 1 MEMBRANE FUSION EVENTS 
Membrane fusion is a ubiquitous cell biological 
process (for a review, see White, J.M., 1992, Science 
258:917-924). Fusion events which mediate cellular 
10 housekeeping functions, such as endocytosis, 

constitutive secretion, and recycling of membrane 
components, occur continuously in all eukaryotic 
cells . 

Additional fusion events occur in specialized 

cells. Intracellularly, for example, fusion events 

are involved in such processes as occur in regulated 

exocytosis of hormones, enzymes and neurotransmitters. 

Intercellularly, such fusion events feature 

prominently in, for example, sperm- egg fusion and 

myoblast fusion. 
20 _ 

Fusion events are also associated with disease 
states. For example, fusion events are involved in 
the formation of giant cells during inflammatory 
reactions, the entry of all enveloped viruses into 
cells, and, in the case of human immunodeficiency 

2 5 virus (HIV), for example, are responsible for the 

virally induced cell-cell fusion which leads to cell 
death. 

2 - 2 . THE HUMAN IMMUNODEFICIENCY VIRUS 

3 0 The human immunodeficiency virus (HIV) has been 

implicated as the primary cause of the slowly 

- 3 - 
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degenerative immune system disease termed acquired 
immune deficiency syndrome (AIDS) (Barre-Sinoussi , F. 
et_al., 1983, Science 220:868-870; Gallo, R. et al . , 
1984, Science 224:500-503). There are at least two 
distinct types of HIV: HIV-1 (Barre-Sinoussi, F. et 
5 al^, 1983, Science 120:868-870; Gallo R. et al. , 1984, 
Science 224:500-503) and HIV-2 (Clavel, F. et al . , 
1986, Science 213:343-346; Guyader, M. et al. . 1987, 
Nature 32^:662-669). Further, a large amount of 
genetic heterogeneity exists within populations of 
10 each of these types. Infection of human CD-4* T- 
lymphocytes with an HIV virus leads to depletion of 
the cell type and eventually to opportunistic 
infections, neurological dysfunctions, neoplastic 
growth, and ultimately death. 

HIV is a member of the lentivirus family of 
retroviruses (Teich, N. et al . . 1984, RNA Tumor 
Viruses, Weiss, R. et al . , eds . , CSH-Press, pp. 949- 
956) . Retroviruses are small enveloped viruses that 
contain a diploid, single-stranded RNA genome, and 
replicate via a DNA intermediate produced by a 
20 virally-encoded reverse transcriptase, an RNA- 

dependent DNA polymerase (Varmus, H., 1988, Science 
2_40: 1427-1439) . Other retroviruses include, for 
example, oncogenic viruses such as human T-cell 
leukemia viruses (HTLV-I, -II, -III) , and feline 
25 leukemia virus. 

The HIV viral particle consists of a viral core, 
composed of capsid proteins, that contains the viral 
RNA genome and those enzymes required for early 
replicative events. Myristylated Gag protein forms an 
3q outer viral shell around the viral core, which is, in 
turn, surrounded by a lipid membrane enveloped derived 
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from the infected cell membrane. The HIV enveloped 
surface glycoproteins are synthesized as a single 160 
Kd precursor protein which is cleaved by a cellular 
protease during viral budding into two glycoproteins, 
gp41 and gpl20. gp4i is a transmembrane protein and 
5 gpl20 is an extracellular protein which remains non- 
covalently associated with gp41, possibly in a 
trimeric or multimeric form (Hammarskjold, M. and 
Rekosh, D. , 1989, Biochem. Biophys. Acta 989:269-280) . 
HIV is targeted to CD-4* cells because the CD-4 
10 cell surface protein acts as the cellular receptor for 
the HIV-1 virus (Dalgleish, A. et al. , 1984, Nature 
312 : 763-767; Klatzmann et aL , 1984, Nature 312 :767- 
768; Maddon et al . . 1986, Cell 47:333-348). Viral ** 
entry into cells is dependent upon gpl2 0 binding the 
cellular CD-4* receptor molecules (McDougal, J.S. et 
aJU, 1986, Science 231:382-385; Maddon, P.J. et al . , 
1986, Cell 47:333-348) and thus explains HIV's tropism 
for CD-4* cells, while gp41 anchors the enveloped 
glycoprotein complex in the viral membrane. 

20 2.3. HIV TREATMENT 

HIV infection is pandemic and HIV associated 
diseases represent a major world health problem. 
Although considerable effort is being put into the 
successful design of effective therapeutics, currently 
25 no curative anti-retroviral drugs against AIDS exist. 
In attempts to develop such drugs, several stages of 
the HIV life cycle have been considered as targets for 
therapeutic intervention (Mitsuya, H. et al. , 1991, 
FASEB J . 5:2369-2381). For example, virally encoded 
3q reverse transcriptase has been one focus of drug 
development. A number of reverse- transcriptase- 
's - 
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targeted drugs, including 2', 3'-dideoxynucleoside 
analogs such as AZT, ddl, ddC, and d4T have been 
developed which have been shown to been active against 
HIV (Mitsuya, H. et al . . 1991, Science 249 : 1533-1544 ) . 
While beneficial, these nucleoside analogs are not 
5 curative, probably due to the rapid appearance of drug 
resistant HIV mutants (Lander, B. et al. . 1989, 
Science 243:1731-1734). in addition, the drugs often 
exhibit toxic side effects such as bone marrow 
suppression, vomiting, and liver function 
10 abnormalities. 

Attempts are also being made to develop drugs 
which can inhibit viral entry into the cell, the 
earliest stage of HIV infection. Here, the focus has" 
thus far been on CD4, the cell surface receptor for 
ifi HIV. Recombinant soluble CD4 , for example, has been 
shown to inhibit infection of CD-4* T-cells by some 
HIV-l strains (Smith, D.H. et al. . 1987, Science 
23fi:l704-l707). Certain primary HIV-l isolates, 
however, are relatively less sensitive to inhibition 
by recombinant CD-4 (Daar, E. et al , . 1990, Proc. 
20 Natl. Acad. Sci. USA 87 : 6574-6579) . In addition, 

recombinant soluble CD-4 clinical trials have produced 
inconclusive results (Schooley, R. et al. . 1990, Ann. 
Int. Med. 112:24 7-253; Kahn, J.O. et al . . 1990, Ann. 
Int. Med. 112:254-261; Yarchoan, R. et al. . 1989, 
25 Proc. Vth Int. Conf. on AIDS, p. 564, MCP 137). 

The late stages of HIV replication, which involve 
crucial virus-specific secondary processing of certain 
viral proteins, have also been suggested as possible 
anti-HIV drug targets. Late stage processing is 
3Q dependent on the activity of a viral protease, and 
drugs are being developed which inhibit this protease 

- 6 - 
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(Erickson, J., 1990, Science 249:527-533). The 
clinical outcome of these candidate drugs is still in 
question. 

Attention is also being given to the development 
of vaccines for the treatment of HIV infection. The 
5 HIV-l enveloped proteins (gpl60, gpl2 0, gp41) have 
been shown to be the major antigens for ant i -HIV 
antibodies present in AIDS patients (Barin, et al . , 
1985, Science 228 :1094-1096) . Thus far, therefore, 
these proteins seem to be the most promising 

10 candidates to act as antigens for anti-HIV vaccine 
development. To this end, several groups have begun 
to use various portions of gpl60, gpl20, and/or gp4l 
as immunogenic targets for the host immune system. 
See for example, Ivanoff, L . et al . , U.S. Pat. No. 
5,141,867; Saith, G. et al . , WO 92/22,654; Shafferman, 
A., WO 91/09,872; Formoso, C. et al . , WO 90/07,119. 
Clinical results concerning these candidate vaccines, 
however, still remain far in the future. 

Thus, although a great deal of effort is being 
directed to the design and testing of ant i- retroviral 

20 drugs, a truly effective, non-toxic treatment is still 
needed. 



3 . SUMMARY OF THE INVENTION 
25 The present invention relates, first, to DP178, a 

3 6 -amino acid synthetic peptide, also referred to 
herein as T20, corresponding to amino acids 63 8 to 673 
of the transmembrane protein (TM) gp41 from the HIV-l 
isolate LAI (HIV-l^,) , which exhibits potent anti- 
3 0 HIV ~ 1 activit 7- The gp41 region from which DP178 is 
derived in referred to herein as HR2 . 

- 7 - 
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The invention further relates to those portions 
and analogs of DP178 which also show such antiviral 
activity, and/or show anti-membrane fusion capability, 
or an ability to modulate intracellular processes 
involving coiled-coil peptide structures. The term 
5 M DP178 analog" refers to a peptide which contains an 
amino acid sequence corresponding to the DP178 peptide 
sequence present within the gp41 protein of HIV-1^, 
but found in viruses and/or organisms other than HIV- 
luki- Such DP178 analog peptides may, therefore, 
10 correspond to DP178-like amino acid sequences present 
in other viruses, such as, for example, enveloped 
viruses, such as retroviruses other than HIV-1^, as 
well as non-enveloped viruses. Further, such 
analogous DP178 peptides may also correspond to DP178- 
like amino acid sequences present in nonviral 
organisms . 

The invention further relates to DP107, a 
peptide, which is also referred to herein as T21, 
corresponding to amino acids 558-595 of the HIV-1^ 
transmembrane protein (TM) gp41. The gp4l region from 
which DP107 is derived is referred to herein as HR1. 
The invention also relates to those portions and 
analogs of DP107 which that also show antiviral 
activity, and/or show anti-membrane fusion capability, 
or an ability to modulate intracellular processes 

25 involving coiled-coil peptide structures. The term 
"DP107 analog" as used herein refers to a peptide 
which contains an amino acid sequence corresponding to 
the DP107 peptide sequence present within the gp41 
protein of HIV-l^, but found in viruses and organisms 

3q other than HIV-1^. Such DP107 analog peptides may, 
therefore, correspond to DP107-like amino acid 

- 8 - 
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seguences present in other viruses, such as, for for 
example, enveloped viruses, such as retroviruses other 
than HIV-1^,, as well as non-enveloped viruses. 
Further, such DP107 analog peptides may also 
correspond to DPl07-like amino acid sequences present 
5 in nonviral organisms. 

Further, the peptides of the invention include 
DP107 analog and DP178 analog peptides having amino 
acid sequences recognized or identified by the 
107x178x4, ALLMOTI5 and/or PLZIP search motifs 
10 described herein. 

The peptides of the invention may, for example, 
exhibit antif usogenic activity, antiviral activity, 
and/or may have the ability to modulate intracellular" 
processes which involve coiled-coil peptide 
structures. With respect to the antiviral activity of 
the peptides of the invention, such an antiviral 
activity includes, but is not limited to the 
inhibition of HIV transmission to uninfected CD-4* 
cells. Additionally, the antif usogenic capability, 
antiviral activity or intracellular modulatory 
20 activity of the peptides of the invention merely 
requires the presence of the peptides of the 
invention, and, specifically, does not require the 
stimulation of a host immune response directed against 
such peptides. 

25 The peptides of the invention may be used, for 

example, as inhibitors of membrane f usion-asociated 
events, such as, for example, the inhibition of human 
and non-human retroviral, especially HIV, transmission 
to uninfected cells. It is further contemplated that 

3o the peptides of the invention may be used as 
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modulators of intracellular events involving coiled- 
coil peptide structures. 

The peptides of the invention may, alternatively, 
be used to identify compounds, including small 
molecule compounds, which may themselves exhibit 
5 antifusogenic, antiviral, or intracellular modulatory 
activity. For example, in one embodiment, the 
peptides of the invention are used to identify other 
DPl78-like and/or DPl07-like peptides that interact 
with each other and/or with their complementary HR1 or 
10 HR2 domains with a lower affinity than the affinity 
exhibited by the "parent" or "native" DP178 or DP107 
peptides from which they are derived. Such DP178-like 
and DP10 7-like peptides, which are also part of the ** 
present invention, may also be used, e.g., to identify 
compounds, such as small molecule compounds, that 
exhibit antifusogenic, antiviral, or intracellular 
modulatory activity. 

Additional uses include, for example, the use of 
the peptides of the invention as organism or viral 
type and/or subtype-specific diagnostic tools. 

2 0 

The terms "antifusogenic" and " ant i- membrane 
fusion", as used herein, refer to an agent 1 s ability 
to inhibit or reduce the level of membrane fusion 
events between two or more moieties relative to the 
level of membrane fusion which occurs between said 

25 moieties in the absence of the peptide. The moieties 
may be, for example, cell membranes or viral 
structures, such as viral envelopes or pili. The term 
"antiviral", as used herein, refers to the compound's 
ability to inhibit viral infection of cells, via, for 

3Q example, cell-cell fusion or free virus infection. 

Such infection may involve membrane fusion, as occurs 

- 10 - 
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in the case of enveloped viruses, or some other fusion 
event involving a viral structure and a cellular 
structure ( e.g. , such as the fusion of a viral pilus 
and bacterial membrane during bacterial conjugation) . 
It is also contemplated that the peptides of the 
5 invention may exhibit the ability to modulate 

intracellular events involving coiled-coil peptide 
structures. "Modulate", as used herein, refers to a 
stimulatory or inhibitory effect on the intracellular 
process of interest relative to the level or activity 
10 of such a process in the absence of a peptide of the 
invention. 

Embodiments of the invention are demonstrated 
below wherein an extremely low concentration of DP178" 
(SEQ ID:1), and very low concentrations of a DP178 
homolog (SEQ ID: 3) are shown to be potent inhibitors 
of HIV-1 mediated CD-4* cell-cell fusion ( i.e. , 
syncytial formation) and infection of CD-4* cells by 
cell -free virus. Further, it is shown that DP178 (SEQ 
ID:1) is not toxic to cells, even at concentrations 3 
logs higher than the inhibitory DP- 178 (SEQ ID:1) 

20 concentration. 

The present invention is based, in part, on the 
surprising discovery that the DP107 and DP178 domains 
of the HIV gp41 protein non-covalently complex with 
each other, and that their interaction is required for 

25 the normal infectivity of the virus. This discovery 
is described in the Example presented, below, in 
Section 8. The invention, therefore, further relates 
to methods for identifying antifusogenic, including 
antiviral, compounds that disrupt the interaction 
between DP107 and DP178, and/or between DP107-like and 
DP178-like peptides. 

- 11 - 
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Additional embodiments of the invention 
(specifically, the Examples presents in Sections 9-16 
and 19-25, below) are demonstrated, below, wherein 
peptides, from a variety of viral and nonviral 
sources, having structural and/or amino acid motif 
5 similarity to DP107 and DP178 are identified, and 
search motifs for their identification are described. 
Further, Examples (in Sections 17, 18, 25-29) are 
presented wherein a number of the peptides of the 
invention are demonstrated exhibit substantial 
10 antiviral activity or activity predictive of antiviral 
activity. 

3.1. DEFINITIONS 
Peptides are defined herein as organic compounds 

^ comprising two or more amino acids covalently joined 
by peptide bonds. Peptides may be referred to with 
respect to the number of constituent amino acids, 
i.e. , a dipeptide contains two amino acid residues, a 
tripeptide contains three, etc. Peptides containing 
ten or fewer amino acids may be referred to as 

20 oligopeptides, while those with more than ten amino 
acid residues are polypeptides. Such peptides may 
also include any of the modifications and additional 
amino and carboxy groups as are described herein. 

Peptide sequences defined herein are represented 

25 ky one-letter symbols for amino acid residues as 

follows : 

A (alanine) 
R (arginine) 
N (asparagine) 
D (aspartic acid) 
C (cysteine) 
30 Q (glutamine) 

E (glutamic acid) 

- 12 - 
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G (glycine) 

H (histidine) 

I (isoleucine) 

L (leucine) 

K (lysine) 

M (methionine) 

F (phenylalanine) 

5 P (proline) 

S (serine) 

T (threonine) 

w (tryptophan) 

Y (tyrosine) 

V (valine) 

4. BRIEF DESCRIPTION OF THE FIGURES 

10 

FIG. 1. Amino acid sequence of DP178 (SEQ ID:1) 
derived from HIV^,; DP17 8 homologs derived from HIV-l sr2 
(DP-185; SEQ ID : 3 ) , HIV-1 RP (SEQ 10:4), and HIV-1^ (SEQ 
ID:5); DP178 homologs derived from amino acid 
sequences of two prototypic HIV-2 isolates, namely, 

15 HIV-2 rod (SEQ ID:6) andHIV-2 N1KZ (SEQ ID:7); control 

peptides: DP-180 (SEQ ID:2), a peptide incorporating 
the amino acid residues of DP178 in a scrambled 
sequence; DP- 118 (SEQ ID: 10) unrelated to DP178, which 
inhibits HIV-1 cell free virus infection; DP-125 (SEQ 

20 ID:8) # unrelated to DP178, also inhibits HIV-1 cell 
free virus infection; DP-116 (SEQ ID:9), unrelated to 
DP178, is negative for inhibition of HIV-1 infection 
when tested using a cell-free virus infection assay. 
Throughout the figures, the one letter amino acid code 
is used. 

25 

FIG. 2. Inhibition of HIV-1 cell-free virus 
infection by synthetic peptides. IC S0 refers to the 
concentration of peptide that inhibits RT production 
from infected cells by 50% compared to the untreated 
control. Control: the level of RT produced by 

30 
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untreated cell cultures infected with the same level 
of virus as treated cultures. 

FIG. 3. Inhibition of HIV-1 and HIV-2 cell-free 
virus infection by the synthetic peptide DP178 (SEQ 
ID:1) . IC 50 : concentration of peptide that inhibits 
5 RT production by 50% compared to the untreated 

control . Control : Level of RT produced by untreated 
cell cultures infected with the same level of virus as 
treated cultures . 

FIG. 4A-4B. Fusion Inhibition Assays. FIG 4A: 
10 DP178 (SEQ ID:l) inhibition of HIV-1 prototypic 

isolate-mediated syncytial formation; data represents 
the number of virus-induced syncytial per cell. FIG. 
4B: DP-180 (SEQ ID:2) represents a scrambled control* 
peptide; DP- 185 (SEQ ID:3) represents a DP178 homolog 
derived from HIV-l SF2 isolate; Control, refers to the 
number of syncytial produced in the absence of 
peptide . 

FIG. 5. Fusion inhibition assay: HIV-1 vs. 
HIV-2. Data represents the number of virus-induced 
syncytial per well. ND: not done. 

2 0 

FIG. 6. Cytotoxicity study of DP178 (SEQ ID:1) 
and DP-116 (SEQ ID: 9) on CEM cells. Cell 
proliferation data is shown. 

FIG. 7. Schematic representation of HIV-gp41 
and maltose binding protein (MBP)-gp41 fusion 

25 proteins. DP107 and DP178 are synthetic peptides 

based on the two putative helices of gp41. The letter 
P in the DP107 boxes denotes an lie to Pro mutation at 
amino acid number 578. Amino acid residues are 
numbered according to Meyers et al., "Human 

3 0 Retroviruses and AIDS", 1991, Theoret. Biol, and 

Biophys. Group, Los Alamos Natl. Lab., Los Alamos, NM. 
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The proteins are more fully described, below, in 
Section 8.1.1. 

FIG. 8. A point mutation alters the 
conformation and anti-HIV activity of M41. 

FIG. 9. Abrogation of DP178 anti-HIV activity. 
5 Cell fusion assays were carried out in the presence of 
10 nM DP178 and various concentrations of M41A178 or 
M41PA178 . 

FIG. 10. Binding of DP178 to leucine zipper of 
gp41 analyzed by FAb-D ELISA. 
10 FIG. 11A-B. Models for a structural transition 

in the HIV-l TM protein. Two models are proposed 
which indicate a structural transition from a native 
oligomer to a fusogenic state following a trigger 
event (possibly gpl20 binding to CD4) . Common 
features of both models include (1) the native state 
is held together by noncovalent protein-protein 
interactions to form the heterodimer of gpl20/41 and 
other interactions, principally though gp41 
interactive sites, to form homo -oligomers on the virus 
surface of the gpl20/41 complexes; (2) shielding of 
20 the hydrophobic fusogenic peptide at the N- terminus 
(F) in the native state; and (3) the leucine zipper 
domain (DP107) exists as a homo-oligomer coiled coil 
only in the fusogenic state. The major differences in 
the two models include the structural state (native or 
25 fusogenic) in which the DP107 and DP178 domains are 
complexed to each other. In the first model (FIG. 
HA) this interaction occurs in the native state and 
in the second (FIG. HB) , it occurs during the 
fusogenic state. When triggered, the fusion complex 
3o in the model depicted in (A) is generated through 
formation of coiled-coil interactions in homologous 

- 15 - 



BN3DOCID: <WO 0151673A2„I^> 



WO 01/51673 



PCT/US00/35727 



DP107 domains resulting in an extended a-helix. This 
conformational change positions the fusion peptide for 
interaction with the cell membrane. In the second 
model (FIG. 11B) , the fusogenic complex is stabilized 
by the association of the DP178 domain with the DP107 
5 coiled-coil. 

FIG. 12. Motif design using heptad repeat 
positioning of amino acids of known coiled-coils . 

FIG. 13. Motif design using proposed heptad 
repeat positioning of amino acids of DP107 and DP178. 
10 FIG . 14. Hybrid motif design crossing GCN4 

and DP107. 

FIG. 15. Hybrid motif design crossing GCN4 

and DP178 . 

FIG. 16. Hybrid motif design 107x178x4, 
crossing DP107 and DP178. This motif was found to be 
the most consistent at identifying relevant DP107-like 
and DPl78-like peptide regions. 

FIG. 17. Hybrid motif design crossing GCN4 , 
DP107, and DP178. 

FIG. 18. Hybrid motif design ALLM0TI5 
20 crossing GCN4 , DP107, DP178, c-Fos c-Jun, c-Myc, and 
Flu Loop 36. 

FIG. 19. PLZIP motifs designed to identify 
N- terminal proline-leucine zipper motifs. 

FIG. 20. Search results for HIV-1 (BRU 
25 isolate) enveloped protein gp41. Sequence search 

motif designations: Spades (±) : 107x178x4; Hearts (¥) 
ALLM0TI5; Clubs (*) : PLZIP; Diamonds (♦) : 
transmembrane region (the putative transmembrane 
domains were identified using a PC/Gene program 
3q designed to search for such peptide regions) . 

Asterisk (*): Lupas method. The amino acid sequences 
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identified by each motif are bracketed by the 
respective characters. Representative sequences 
chosen based on 107x178x4 searches are underlined and 
in bold. DPI 07 and DP178 sequences are marked, and 
additionally double -underlined and italicized. 
5 FIG. 21. Search results for human 

respiratory syncytial virus (RSV) strain A2 fusion 
glycoprotein Fl . Sequence search motif designations 
are as in FIG. 20. 

FIG. 22. Search results for simian 
10 immunodeficiency virus (SIV) enveloped protein gp41 
(AGM3 isolate) . Sequence search motif designations 
are as in FIG. 20 . 

FIG. 23. Search results for canine 
distemper virus (strain Onderstepoort ) fusion 
glycoprotein l. Sequence search motif designations 
are as in FIG. 20. 

FIG. 24. Search results for newcastle 
disease virus (strain Australia-Victoria/32) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG . 20. 

2 0 

FIG. 25. Search results for human 
parainfluenza 3 virus (strain NIH 47885) fusion 
glycoprotein Fl. Sequence search motif designations 
are as in FIG. 20. 

FIG. 26. Search results for influenza A 
25 virus (strain A/AICHI/2/68) hemagglutinin precursor 
HA2. Sequence search designations are as in FIG. 20. 

FIG. 2 7A-D. Respiratory Syncytial Virus 
(RSV) peptide antiviral and circular dichroism data. 
FIG. 27A-B: Peptides derived from the F2 DP178/DP107- 
30 like re 9 ion - Antiviral and CD data. FIG. 27C-D: 
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Peptides derived from the Fl DP107-like region. 
Peptide and CD data. 

Antiviral activity (AV) is represented by the 
following qualitative symbols: 

negative antiviral activity; 
5 "+/-", antiviral activity at greater than 

lOOpg/ml; 

" + ", antiviral activity at between 50-100/tg/ml; 
"++" , antiviral activity at between 20-50^g/ml; 
"+++", antiviral activity at between l-20/*g/ml; 
10 "++++", antiviral activity at <ljig/ml. 

CD data, referring to the level of helicity is 
represented by the following qualitative symbol: 
no helicity; 
" + n , 25-50% helicity; 
"++", 50-75% helicity; 
«+++»• 75-100% helicity. 

IC S0 refers to the concentration of peptide 
necessary to produce only 50% of the number of 
syncytial relative to infected control cultures 
containing no peptide. IC 50 values were obtained using 

20 purified peptides only. 

FIG. 28A-B. Respiratory Syncytial Virus 
(RSV) DP178-like region (Fl) peptide antiviral and CD 
data. Antiviral symbols, CD symbols, and IC 50 are as 
in FIG. 27A-D. IC S0 values were obtained using 

25 purified peptides only. 

FIG. 29A-B. Peptides derived from the HPIV3 
Fl DP107-like region. Peptide antiviral and CD data. 
Antiviral symbols, CD symbols, and IC 50 are as in FIG. 
27A-D. Purified peptides were used to obtain IC 50 

3o values, except where the values are marked by an 
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asterisk (*), in which cases, the IC 50 values were 
obtained using a crude peptide preparation. 

FIG. 2 9C. HPIV3 peptide T-184 CD spectrum 
at 1°C in 0.1M NaCl lOmM KP0 4 , pH 7.0. The data 
demonstrates the peptide's helical secondary structure 
5 (6222/209=1 -2) over a wide range of concentrations (100- 
1500/zM) . This evidence is consistent with the peptide 
forming a helical coiled-coil structure. 

FIG. 30A-B. Peptides derived from the HPIV3 
Fl DP178-like region. Peptide antiviral and CD data. 
10 Antiviral symbols, CD symbols, and IC S0 are as in FIG. 
27A-D. Purified peptides were used to obtain IC 50 
values, except where the values are marked by an 
asterisk (*) , in which cases, the IC 50 values were *' 
obtained using a crude peptide preparation. 

FIG. 31. Motif search results for simian 
immunodeficiency virus (SIV) isolate MM251, enveloped 
polyprotein gp41. Sequence search designations are as 
in FIG. 20. 

FIG. 32. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , glycoprotein gpllO 

20 precursor (designated gpll5) . BALF4 . Sequence search 
designations are as in FIG. 20. 

FIG. 33. Motif search results for Epstein- 
Barr Virus (Strain B95-8) , BZLFl trans-activator 
protein (designated EB1 or Zebra) . Sequence search 

25 designations are as in FIG. 20. Additionally, n @" 
refers to a well known DNA binding domain and 
refers to a well known dimerization domain, as defined 
by Flemington and Speck (Flemington, E. and Speck, 
S.H., 1990, Proc. Natl. Acad, Sci . USA 87:9459-9463). 

3Q FIG. 34. Motif search results for measles 
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virus (strain Edmonston) , fusion glycoprotein Fl. 
Sequence search designations are as in FIG. 20. 

FIG. 35. Motif search results for Hepatitis 
B Virus (Subtype AYW) , major surface antigen precursor 
S. Sequence search designations are as in FIG. 20. 
5 FIG. 36. Motif search results for simian 

Mason- Pfizer monkey virus, enveloped (TM) protein 
gp20. Sequence search designations are as in FIG. 20. 

FIG. 37. Motif search results for 
Pseudomonas aerginosa, fimbrial protein (Pilin) . 
10 Sequence search designations are as in FIG. 20. 

FIG. 38. Motif search results for Neisseria 
gonorrhoeae fimbrial protein (Pilin) . Sequence search 
designations are as in FIG. 20. 

FIG. 39. Motif search results for 
Hemophilus influenzae fimbrial protein. Sequence 
search designations are as in FIG. 20. 

FIG. 40. Motif search results for 
Staphylococcus aureus, toxic shock syndrome toxin- 1. 
Sequence search designations are as in FIG. 20. 

FIG. 41. Motif search results for 
Staphylococcus aureus enterotoxin Type E. Sequence 
search designations are as in FIG. 20. 

FIG. 42. Motif search results for 
Staphylococcus aureus enterotoxin A. Sequence search 
designations are as in FIG . 20. 
25 FIG. 43. Motif search results for 

Escherichia coli, heat labile enterotoxin A. Sequence 
search designations are as in FIG. 20. 

FIG. 44. Motif search results for human c- 
fos proto-oncoprotein. Sequence search designations 
are as in FIG. 20. 

30 

- 20 - 



BNSDOCID: <WO. 01 51673A2. 1 .> 



WO 01/51673 



PCT7US00/35727 



FIG. 45. Motif search results for human 
lupus KU autoantigen protein P70. Sequence search 
designations are as in FIG. 20. 

FIG. 46. Motif search results for human 
zinc finger protein 10. Sequence search designations 
5 are as in FIG. 20. 

FIG. 47. Measles virus (MeV) fusion protein 
DP178-like region antiviral and CD data. Antiviral 
symbols, CD symbols, and IC 50 are as in FIG. 27A-D. 
IC 50 values were obtained using purified peptides. 
10 FIG. 48. Simian immunodeficiency virus 

(SIV) TM (fusion) protein DP178-like region antiviral 
data. Antiviral symbols are as in FIG. 27A-D "NT", 
not tested. 

FIG. 49A-C. DPl78-derived peptide antiviral 
data. The peptides listed herein were derived from 
the region surrounding the HIV- 1 BRU isolate DP178 
region ( e.g. , gp41 amino acid residues 615-717) . 

In instances where peptides contained DP178 point 
mutations, the mutated amino acid residues are shown 
with a shaded background. In instances in which the 
20 test peptide has had an amino and/or carboxy- terminal 
group added or removed (apart from the standard amido- 
and acetyl- blocking groups found on such peptides), 
such modifications are indicated. FIG. 49A: The 
column to the immediate right of the name of the test 
25 peptide indicates the size of the test peptide and 
points out whether the peptide is derived from a one 
amino acid peptide "walk" across the DP178 region. 
The next column to the right indicates whether the 
test peptide contains a point mutation, while the 
^ column to its right indicates whether certain amino 
acid residues have been added to or removed from the 
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DP178 -derived amino acid sequence. FIG 49B: The 
column to the immediate right of the test peptide name 
indicates whether the peptide represents a DP178 
truncation, the next column to the right points out 
whether the peptide contains a point mutation, and the 
5 column to its right indicates whether the peptide 
contains amino acids which have been added to or 
removed from the DP178 sequence itself. FIG. 49C: 
The column to the immediate right of the test peptide 
name indicates whether the test peptide contains a 
10 point mutation, while the column to its right 

indicates whether amino acid residues have been added 
to or removed from the DP178 sequence itself. IC S0 is 
as defined in FIG. 27A-D, and IC 50 values were obtained 
using purified peptides except where marked with an 
asterisk (*) , in which case the IC 50 was obtained using 
a crude peptide preparation. 

FIG. 50. DP107 and DP107 gp41 region 
truncated peptide antiviral data. IC 50 as defined in 
FIG. 2 7A-D, and IC 50 values were obtained using 
purified peptides except where marked with an asterisk 
20 <*), in which case the IC 50 was obtained using a crude 
peptide preparation. 

FIG. 51A-B. Epstein-Barr virus Strain B95-8 
BZLFl DP178/DP107 analog region peptide walks and 
electrophoretic mobility shift assay results. The 
25 peptides (T-423 to T-446, FIG. 51A; T-447 to T-461, 
FIG. 51B) represent one amino acid residue "walks" 
through the EBV Zebra protein region from amino acid 
residue 173 to 246. 

The amino acid residue within this region which 
30 corres P° nds to the first amino acid residue of each 
peptide is listed to the left of each peptide, while 
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the amino acid residue within this region which 
corresponds to the last amino acid residue of each 
peptide is listed to the right of each peptide. The 
length of each test peptide is listed at the far right 
of each line, under the heading "Res". 
5 11 ACT" refers to a test peptide's ability to 

inhibit Zebra binding to its response element. n + " 
refers to a visible, but incomplete, abrogation of the 
response element/Zebra homodimer complex; "+++" refers 
to a complete abrogation of the complex; and "-" 
10 represents a lack of complex disruption. 

FIG. 52A-B. Hepatitis B virus subtype AYW major 
surface antigen precursor S protein DP17B/DP107 analog 
region and peptide walks. 52A depicts Domain I (S 
protein amino acid residues 174-220), which contains a 
potential DP178/DP107 analog region. In addition, 
peptides are listed which represent one amino acid 
peptide "walks" through domain I. 52B depicts Domain 
II (S protein amino acid residues 233-291), which 
contains a second potential DP178/DP107 analog region. 
In addition, peptides are listed which represent one 
20 amino acid peptide "walks" through domain II. 

FIG. 53: Cell fusion and competitive inhibition 
data for alanine walk experiments for the DP178-like 
Respiratory Syncytial Virus (RSV) peptide T112 . 

FIG. 54: Circular dichroism, cell fusion and 
25 competitive inhibition data for alanine walk 

experiments for the peptide T20, which is also known 
as DP178. 



5. DETAILED DESCRIPTION OF THE INVENTION 
Described herein are peptides which may exhibit 
antifusogenic activity, antiviral capability, and/or 
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the ability to modulate intracellular processes 
involving coiled-coil peptide structures. The 
peptides described include, first, DP178 (SEQ ID 
NO:l), a gp41-derived 36 amino acid peptide and 
fragments and analogs of DP178. 
5 In addition, the peptides of the invention 

described herein include peptides which are DP107 
analogs. DP107 (SEQ ID NO: 25) is a 38 amino acid 
peptide corresponding to residues 558 to 595 of the 
HIV-l^ transmembrane (TM) gp41 protein. Such DP107 
10 analogs may exhibit ant if usogenic capability, 
antiviral activity or an ability to modulate 
intracellular processes involving coiled-coil 
structures. 

Further, peptides of the invention include DP107 
and DP178 are described herein having amino acid 
sequences recognized by the 107x178x4, ALLMOTI5, and 
PL2IP search motifs. Such motifs are also discussed. 

Also described here are antif usogenic, antiviral, 
intracellular modulatory, and diagnostic uses of the 
peptides of the invention. Further, procedures are 
20 described for the use of the peptides of the invention 
for the identification of compounds exhibiting 
ant if usogenic, antiviral or intracellular modulatory 
activity. 

While not limited to any theory of operation, the 
25 following model is proposed to explain the potent 
anti-HIV activity of DP178, based, in part, on the 
experiments described in the Examples, infra . In the 
HIV protein, gp41, DP178 corresponds to a putative cr- 
helix region located in the C-terminal end of the gp41 
3o ectodomain, and appears to associate with a distal 

site on gp4l whose interactive structure is influenced 
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by the leucine zipper motif, a coiled-coil structure, 
referred to as DP107. The association of these two 
domains may reflect a molecular linkage or "molecular 
clasp" intimately involved in the fusion process. It 
is of interest that mutations in the C-terminal or- 
5 helix motif of gp41 ( i.e. , the D178 domain) tend to 
enhance the fusion ability of gp41, whereas mutations 
in the leucine zipper region ( i.e. , the DP107 domain) 
decrease or abolish the fusion ability of the viral 
protein. It may be that the leucine zipper motif is 
10 involved in membrane fusion while the C-terminal a- 
helix motif serves as a molecular safety to regulate 
the availability of the leucine zipper during virus- 
induced membrane fusion. 

On the basis of the foregoing, two models are 
proposed of gp41 -mediated membrane fusion which are 
schematically shown in FIG. 11A-B. The reason for 
proposing two models is that the temporal nature of 
the interaction between the regions defined by DP107 
and DP178 cannot, as yet, be pinpointed. Each model 
envisions two conformations for gp41 - one in a 
20 "native" state as it might be found on a resting 

virion. The other in a "fusogenic" state to reflect 
conformational changes triggered following binding of 
gpl20 to CD 4 and just prior to fusion with the target 
cell membrane. The strong binding affinity between 
25 gpl20 and CD4 may actually represent the trigger for 
the fusion process obviating the need for a pH change 
such as occurs for viruses that fuse within 
intracellular vesicles. The two major features of 
both models are: (l) the leucine zipper sequences 
3Q (DP107) in each chain of oligomeric enveloped are held 
apart in the native state and are only allowed access 
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to one another in the fusogenic state so as to form 
the extremely stable coiled-coils, and (2) association 
of the DP178 and DP107 sites as they exist in gp4l 
occur either in the native or fusogenic state. FIG. 
HA depicts DP178/DP107 interaction in the native 
5 state as a molecular clasp. On the other hand, if one 
assumes that the most stable form of the enveloped 
occurs in the fusogenic state, the model in FIG. 11B 
can be considered. 

When synthesized as peptides, both DP107 and 
10 DP178 are potent inhibitors cf HIV infection and 

fusion, probably by virtue of their ability to form 
complexes with viral gp4l and interfere with its 
fusogenic process; e.g. . during the structural 
transition of the viral protein from the native 
structure to the fusogenic state, the DP178 and DP107 
peptides may gain access to their respective binding 
sites on the viral gp41, and exert a disruptive 
influence. DP107 peptides which demonstrate anti-HIV 
activity are described in Applicants 1 co-pending 
application Serial No. 08/264,531, filed June 23, 

2 0 

1994, which is incorporated by reference herein in its 
entirety. 

As shown in the Examples, infra, a truncated 
recombinant gp41 protein corresponding to the 
ectodomain of gp41 containing both DP107 and DP178 

25 domains (excluding the fusion peptide, transmembrane 
region and cytoplasmic domain of gp41) did not inhibit 
Hiv-i induced fusion. However, when a single mutation 
was introduced to disrupt the coiled- coil structure of 
the DP107 domain --a mutation which results in a 

30 total loss of biological activity of DP107 peptides -- 
the inactive recombinant protein was transformed to an 
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active inhibitor of HIV-1 induced fusion. This 
transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
leucine zipper, DP107 domain. 

For clarity of discussion, the invention will be 
5 described primarily for DP178 peptide inhibitors of 
HIV. However, the principles may be analogously 
applied to other viruses, both enveloped and 
nonenveloped, and to other non-viral organisms. 



10 
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5.1. DP178 AND DP178-LIKE PEPTIDES 
The DPI 7 8 peptide (SEQ ID:1) of the invention 
corresponds to amino acid residues 63 8 to 673 of the 
transmembrane protein gp41 from the HIV-1^ isolate, 
and has the 3 6 amino acid sequence (reading from amino 
5 to carboxy terminus) : 

NH 2 -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH (SEQ ID:1) 

In addition to the full-length DP178 (SEQ ID:1) 
10 36-mer, the peptides of the invention may include 
truncations of the DP178 (SEQ ID:1) peptide which 
exhibit antif usogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes 
involving coiled-coil peptide structures. Truncations 
of DP178 (SEQ ID:l) peptides may comprise peptides of 
between 3 and 36 amino acid residues ( i.e. , peptides 
ranging in size from a tripeptide to a 3 6-mer 
polypeptide) , as shown in Tables I and IA, below. 
Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus. "X M may 

2 0 

represent an amino group (-NH 2 ) and n Z n may represent a 
carboxyl (-COOH) group. Alternatively, "X" may 
represent a hydrophobic group, including but not 
limited to carbobenzyl, dansyl, or T-butoxycarbonyl; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (FMOC) 

25 group; or a covalently attached macromolecular group, 
including but not limited to a lipid- fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, "Z" may represent an amido 
group; a T-butoxycarbonyl group; or a covalently 

30 attached macromolecular group, including but not 

limited to a lipid-fatty acid conjugate, polyethylene 
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glycol, carbohydrate or peptide group. A preferred 
M X" or "Z M macromolecular group is a peptide group. 



TABLE I 

DPI 7 8 (SEP ID: 1) CARBOXY TRUNCATIONS 

5 X-YTS-Z 

X-YTSL-Z 

X-YTSLI-Z 

X-YTSLIH-Z 

X-YTSLIHS-Z 

X-YTSLIHSL-Z 

X-YTSLIHSLI-Z 

X-YTSLIHSLIE-Z 
10 X-YTSLIHSLIEE-Z 

X-YTSLIHSLIEES-Z 

X-YTSLIHSLIEESQ-Z 

X-YTSLIHSLIEESQN-Z 

X-YTSLIHSLIEESQNQ-Z 

X-YTSLIHSLIEESQNQQ-Z 

X-YTSLIHSLIEESQNQQE-Z 

X - YTSLIHSLIEESQNQQEK- Z 
15 X-YTSLIHSLIEESQNQQEKN-Z 

X-YTSLIHSLIEESQNQQEKNE-Z 
X-YTSLIHSLIEESQNQQEKNEQ-Z 
X-YTSLIHSLIEESQNQQEKNEQE-Z 
X-YTSLIHSLIEESQNQQEKNEQEL- Z 
X - YTS LIHSLIEES QNQQEKNEQELL - Z 
X-YTSLIHSLIEES QNQQEKNEQELLE - Z 
X - YTSL I HSLI EESQNQQEKNEQELLEL - Z 
10 x . YTSLIHSLIEESQNQQEKNEQELLELD - Z 
X-YTSLIHSLIEES QNQQE KNEQE LLELDK - Z 
X-YTSLIHSLIEESQNQQEKNEQELLELDKW-Z 
X-YTSLIHSLIEES QNQQE KNEQELLELDKWA - Z 
X-YTSLIHSLIEES QNQQE KNEQE LLELDKWAS - Z 
X - YTSL IHS L I EESQNQQEKNEQELLELDKWASL - Z 
X- YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-Z 
X- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-Z 
5 X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-Z 
X- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF- Z 



The one letter amino acid code is used. 
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TABLE IA 

DP178 (SEP ID:1) AMINO TRUNCATIONS 

X-NWF-Z 
X-WNWF-Z 
X-LWNWF-Z 
X-SLWNWF-Z 

5 X-ASLWNWF-Z 

X-WASLWNWF-Z 
X-KWASLWNWF-Z 
X-DKWASLWNWF-Z 
X-LDKWASLWNWF-Z 
X - ELDKWASLWNWF - Z 
X - LELDKWASLWNWF - Z 
X - LLELDKWAS LWNWF - Z 

10 X - ELLELDKWAS LWNWF - Z 

X - QELLELDKWASLWNWF - Z 
X - EQELLELDKWAS LWNWF - Z 
X - NEQELLELDKWASLWNWF - Z 
X - KNEQELLELDKWASLWNWF - Z 
X - E KNEQELLELDKWASLWNWF - Z 
X - QEKNEQELLELDKWASLWNWF - Z 
X - QQEKNEQELLELDKWASLWNWF - Z 
15 X - NQQEKNEQELLELDKWASLWNWF - Z 

X - QNQQEKNEQELLELDKWASLWNWF - Z 
X - SQNQQEKNEQELLELDKWASLWNWF - Z 
X - ESQNQQEKNEQELLELDKWASLWNWF - Z 
X - EESQNQQEKNEQELLELDKWASLWNWF- Z 
X - 1 EESQNQQEKNEQELLELDKWASLWNWF - Z 
X-LIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X - S L I EE S QNQQE KNE QELLELDKWAS LWNWF - Z 
2 0 X-HSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X- IHSLIEESQNQQEKNEQELLELDKWASLWNWF- Z 
X-LIHSLIEES QNQQE KNEQELLELDKWAS LWNWF - Z 
X-SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 



The one letter amino acid code is used. 

25 



30 
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The peptides of the invention also include DP178- 
like peptides. "DP178-like" , as used herein, refers, 
first, to DP178 and DP178 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, "DP-178-like" refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5, 107x178x4 and PLZIP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP178. The DP178-like peptides of the 
invention may exhibit antif usogenic or antiviral 
10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled-coil 
peptides. Further, such DP178-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 
or reduced host immune recognition. 

HIV-l and HIV- 2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP178 -corresponding 
regions of HIV-l and HIV- 2. The amino acid 
conservation is of a periodic nature, suggesting some 

2 0. 

conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP178 peptides of 
the invention. Utilizing the DP178 and DP178 analog 
25 sequences described herein, the skilled artisan can 
readily compile DP178 consensus sequences and 
ascertain from these, conserved amino acid residues 
which would represent preferred amino acid 
substitutions . 

3 0 The amino acid substitutions may be of a 

conserved or non-conserved nature. Conserved amino 
acid substitutions consist of replacing one or more 
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amino acids of the DP178 (SEQ ID:1) peptide sequence 
with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, 
a glutamic acid (E) to aspartic acid (D) amino acid 
substitution. Non-conserved substitutions consist of 
5 replacing one or more amino acids of the DP178 (SEQ 
ID:l) peptide sequence with amino acids possessing 
dissimilar charge, size, and/or hydrophobicity 
characteristics, such as, for example, a glutamic acid 
(E) to valine (V) substitution. 
10 Amino acid insertions may consist of single amino 

acid residues or stretches of residues. The 
insertions may be made at the carboxy or amino 
terminal end of the DP178 or DP178 truncated peptides, 
as well as at a position internal to the peptide. 
Such insertions will generally range from 2 to 15 
amino acids in length. It is contemplated that 
insertions made at either the carboxy or amino 
terminus of the peptide of interest may be of a 
broader size range, with about 2 to about 50 amino 
acids being preferred. One or more such insertions 
20 may be introduced into DP178 (SEQ. ID: 1) or DP178 
truncations, as long as such insertions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antif usogenic 
25 or antiviral activity, or exhibit the ability to 

modulate intracellular processes involving coiled-coil 
peptide structures. 

Preferred amino or carboxy terminal insertions 
are peptides ranging from about 2 to about 50 amino 
3 0 acid residues in length, corresponding to gp4l protein 
regions either amino to or carboxy to the actual DP178 
gp4l amino acid sequence, respectively. Thus, a 
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preferred amino terminal, or carboxy terminal amino 
acid insertion would contain gp41 amino acid sequences 
found immediately amino to or carboxy to the DP178 
region of the gp41 protein. 

Deletions of DP178 (SEQ ID:1) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP178 or DP178-like 
peptide sequence, with the lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 
10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the peptide 
sequences. One or more such deletions may be 
introduced into DP178 (SEQ. ID: 1) or DP178 truncations, 
as long as such deletions result in peptides which may 
. still be recognized by the 107x178x4, ALLMOTI5 or 
PLZIP search motifs described herein, or may, 
alternatively, exhibit antifusogenic or antiviral 
activity, or exhibit the ability to modulate 
intracellular processes involving coiled-coil peptide 
structures. 

10 DP178 analogs are further described, below, in 

Section 5.3. 



5.2. DP107 AND DP107-LIKE PEPTIDES 
Further, the peptides of the invention include 
25 peptides having amino acid sequences corresponding to 
DP107 analogs. DP107 is a 38 amino acid peptide which 
exhibits potent antiviral activity, and corresponds to 
residues 558 to 595 of HIV-1^ transmembrane <TM) gp41 
protein, as shown here: 



30 



NH 2 -NNLLRAIEAQQHLLQLTVWQIKQLQARIIiAVERYLKDQ-COOH 

(SEQ ID:25) 
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In addition to the full-length DP107 (SEQ ID: 25) 
38-mer, the peptides of the invention may include 
truncations of the DP107 (SEQ ID: 25) peptide which 
exhibit antif usogenic activity, antiviral activity 
and/or the ability to modulate intracellular processes 
5 involving coiled-coil peptide structures. Truncations 
of DP107 (SEQ ID: 25) peptides may comprise peptides of 
between 3 and 3 8 amino acid residues ( i.e. , peptides 
ranging in size from a tripeptide to a 3 8-mer 
polypeptide), as shown in Tables II and IIA, below. 
10 Peptide sequences in these tables are listed from 
amino (left) to carboxy (right) terminus; "X" may 
represent an amino group ( -NH 2 ) and "Z n may represent a 
carboxyl (-COOH) group. Alternatively, "X n may 
represent a hydrophobic group, including but not 
^ limited to carbobenzyl, dansyl, or T-butoxycarbonyl ; 
an acetyl group; a 9-f luorenylmethoxy-carbonyl (PMOC) 
group; or a covalently attached macromolecular group, 
including but not limited to a lipid-fatty acid 
conjugate, polyethylene glycol, carbohydrate or 
peptide group. Further, "2" may represent an amido 
10 group; a T-butoxycarbonyl group; or a covalently 
attached macromolecular group, including but not 
limited to a lipid-fatty acid conjugate, polyethylene 
glycol, carbohydrate or peptide group. A preferred 
"X" or "Z" macromolecular group is a peptide group. 
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TABLE II 

DPI 0 7 (SEP ID: 25) CARBOXY TRUNCATIONS 



X-NNL-Z 

X-NNLL-Z 

X-NNLLR-Z 
5 X-NNLLRA-Z 

X - NNLLRAI -Z 

X-NNLLRAIE-Z 

X-NNLLRAIEA- Z 

X - NNLLRA I E AQ - Z 

X - NNLLRAI EAQQ - Z 

X - NNLLRAI EAQQH - Z 

X - NNLLRA I E AQQHL - Z 
10 X - NNLLRA I EAQQHLL - Z 

X - NNLLRA I EAQQHLLQ - Z 

X - NNLLRAI EAQQHLLQL - Z 

X-NNLLRAIEAQQHLLQLT- Z 

X - NNLLRAI EAQQHLLQLTV- Z 

X-NNLLRAIEAQQHLLQLTVW- Z 

X - NNLLRAIEAQQHLLQLTVWQ- Z 

X - NNLLRAI EAQQHLLQLTVWQ I - Z 
15 X - NNLLRA I EAQQHLLQLTVWQ I K - Z 
X - NNLLRA I EAQQHLLQLTVWQ I KQ - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQL - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQ - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQ A - Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQAR - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQ AR I - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQARI L - Z 
2 0 X - NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LA - Z 
X - NNLLRAIEAQQHLLQLTVWQ I KQLQAR I LAV - Z 
X -NNLLRAIEAQQHLLQLTVWQIKQLQARILAVE- Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVER - Z 
X - NNLLRA I EAQQHLLQLTVWQ I KQLQARI LAVER Y - Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVER YL - Z 
X -NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLK- Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQARI LAVERYLKD - Z 
25 X - NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 



The one letter amino acid code is used. 
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TABLE IIA 
DPI 7 8 (SEP ID: 25) AMINO TRUNCATIONS 



X-KDQ- Z 
X-LKDQ- Z 
X- YLKDQ- Z 

5 X-RYLKDQ- Z 

X-ER YLKDQ- Z 
X-VER YLKDQ- Z 
X-AVERYLKDQ- Z 
X - LAVER YLKDQ - Z 
X-ILAVERYLKDQ- Z 
X - RILAVERYLKDQ - Z 
X-AR I LAVER YLKDQ- Z 

10 X-QAR I LAVER YLKDQ- Z 

X - LQARILAVERYLKDQ - Z 
X - QLQARILAVERYLKDQ - Z 
X - KQLQAR I LAVER YLKDQ - Z 
X - I KQLQARI LAVERYLKDQ - Z 
X - Q I KQLQARI LAVER YLKDQ - .Z 
X - WQ I KQ L Q AR I LAVER YL KDQ - Z 
X - VWQ I KQLQAR I LAVERYLKDQ - Z 
15 X - TVWQ I KQLQARI LAVER YLKDQ - Z 

X - LTVWQI KQLQARILAVER YLKDQ - Z 
X - QLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - LQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X-LLQLTVWQIKQLQARILAVERYLKDQ- Z 
X-HLLQLTVWQIKQLQARILAVERYLKDQ- Z 
X - QHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - QQHLLQLT VWQ I KQLQAR I LAVERYLKDQ - Z 
20 X - AQQHLLQLTVWQIKQLQARILAVERYLKDQ - Z 

X - E AQQHLLQLT VWQ I KQLQAR I LAVERYLKDQ - Z 
x " I EAQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - AIEAQQHLLQLTVWQI KQLQARILAVERYLKDQ - Z 
X-RAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ- Z 
X - LRAI E AQQHLLQLT VWQ I KQLQARI LAVERYLKDQ - Z 
X - LLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - NLLRAI EAQQHLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
25 X - NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVER YL KDQ - Z 



The one letter amino acid code is used, 
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The peptides of the invention also include DP107- 
like peptides. "DPlOT-like" , as used herein, refers, 
first, to DP107 and DP107 truncations which contain 
one or more amino acid substitutions, insertions 
and/or deletions. Second, "DP-107-like" refers to 
5 peptide sequences identified or recognized by the 

ALLMOTI5, 10 7x178x4 and PLZIP search motifs described 
herein, having structural and/or amino acid motif 
similarity to DP107. The DP107-like peptides of the 
invention may exhibit ant if usogenic or antiviral 
10 activity, or may exhibit the ability to modulate 
intracellular processes involving coiled-coil 
peptides. Further, such DP107-like peptides may 
possess additional advantageous features, such as, for 
example, increased bioavailability, and/or stability, 
^ or reduced host immune recognition. 

HIV-i and HIV- 2 enveloped proteins are 
structurally distinct, but there exists a striking 
amino acid conservation within the DP107 -corresponding 
regions of HIV-1 and HIV-2. The amino acid 
conservation is of a periodic nature, suggesting some 

2 0 

conservation of structure and/or function. Therefore, 
one possible class of amino acid substitutions would 
include those amino acid changes which are predicted 
to stabilize the structure of the DP107 peptides of 
the invention. Utilizing the DP107 and DP107 analog 
25 sequences described herein, the skilled artisan can 
readily compile DP107 consensus sequences and 
ascertain from these, conserved amino acid residues 
which would represent preferred amino acid 
substitutions . 

30 The amino acid substitutions may be of a 

conserved or non-conserved nature. Conserved amino 
acid substitutions consist of replacing one or more 
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amino acids of the DP107 (SEQ ID: 25) peptide sequence 
with amino acids of similar charge, size, and/or 
hydrophobic ity characteristics, such as, for example, 
a glutamic acid (E) to aspartic acid (D) amino acid 
substitution. Non-conserved substitutions consist of 
5 replacing one or more amino acids of the DP107 (SEQ 
ID: 25) peptide sequence with amino acids possessing 
dissimilar charge, size, and/or hydrophobicity 
characteristics, such as, for example, a glutamic acid 
(E) to valine (V) substitution. 
10 Amino acid insertions may consist of single amino 

acid residues or stretches of residues. The 
insertions may be made at the carboxy or amino 
terminal end of the DP107 or DP107 truncated peptides, 
as well as at a position internal to the peptide. 
Such insertions will generally range from 2 to 15 
amino acids in length. It is contemplated that 
insertions made at either the carboxy or amino 
terminus of the peptide of interest may be of a 
broader size range, with about 2 to about 5 0 amino 
acids being preferred. One or more such insertions 
20 may be introduced into DP107 (SEQ.ID:25) or DP107 
truncations, as long as such insertions result in 
peptides which may still be recognized by the 
107x178x4, ALLM0TI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antif usogenic 
25 or antiviral activity, or exhibit the ability to 

modulate intracellular processes involving coiled-coil 
peptide structures. 

Preferred amino or carboxy terminal insertions 
are peptides ranging from about 2 to about 50 amino 
3q acid residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to the actual DP107 
gp41 amino acid sequence, respectively. Thus, a 
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preferred amino terminal or carboxy terminal amino 
acid insertion would contain gp41 amino acid sequences 
found immediately amino to or carboxy to the DP107 
region of the gp41 protein. 

Deletions of DP107 {SEQ ID:25) or DP178 
5 truncations are also within the scope of the 

invention. Such deletions consist of the removal of 
one or more amino acids from the DP107 or DP107-like 
peptide sequence, with the lower limit length of the 
resulting peptide sequence being 4 to 6 amino acids. 
10 Such deletions may involve a single contiguous or 
greater than one discrete portion of the peptide 
sequences. One or more such deletions may be 
introduced into DP107 (SEQ.ID:25) or DP107 
truncations, as long as such deletions result in 
peptides which may still be recognized by the 
107x178x4, ALLMOTI5 or PLZIP search motifs described 
herein, or may, alternatively, exhibit antifusogenic 
or antiviral activity, or exhibit the ability to 
modulate intracellular processes involving coiled-coil 
peptide structures. 
i4i DP107 and DP107 truncations are more fully 

described in Applicants 1 co-pending U.S. Patent 
Application Ser. No. 08/374,666, filed January 27, 
1995, and which is incorporated herein by reference in 
its entirety. DP107 analogs are further described, 
25 below, in Section 5.3. 

5.3. DP107 and DP178 ANALOGS 

Peptides corresponding to analogs of the DP178, 
DP178 truncations, DP107 and DP107 truncation 
3Q sequences of the invention, described, above, in 
Sections 5.1 and 5.2 may be found in other viruses, 
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including, for example, non-HIV-1^ enveloped viruses, 
non-enveloped viruses and other non-viral organisms. 

The term "analog", as used herein, refers to a 
peptide which is recognized or identified via the 
107x178x4, ALLM0TI5 and/or PL2IP search strategies 
5 discussed below. Further, such peptides may exhibit 
antifusogenic capability, antiviral activity, or the 
ability to modulate intracellular processes involving 
coiled-coil structures . 

Such DP178 and DP107 analogs may, for example, 
10 correspond to peptide sequences present in TM proteins 
of enveloped viruses and may, additionally correspond 
to peptide sequences present in non enveloped and non- 
viral organisms. Such peptides may exhibit 
antifusogenic activity, antiviral activity, most 
• particularly antiviral activity which is specific to 
the virus in which their native sequences are found, 
or may exhibit an ability to modulate intracellular 
processes involving coiled-coil peptide structures. 

DP178 analogs are peptides whose amino acid 
sequences are comprised of the amino acid sequences of 
20 peptide regions of, for example, other ( i.e. , other 
than HIV-l^j) viruses that correspond to the gp41 
peptide region from which DP178 (SEQ ID:1) was 
derived. Such viruses may include, but are not 
limited to, other HIV-l isolates and HIV-2 isolates. 
25 DP178 analogs derived from the corresponding gp41 
peptide region of other ( i.e. . non HIV-l^j) HIV-l 
isolates may include, for example, peptide sequences 
as shown below. 

3 NH 2 - YTNT IYTLLEESQNQQEKNEQELLELDKWASLWNWF - COOH 

(DP-185; SEQ ID: 3) ; 
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NH 2 - YTGI I YNLLEESQNQQEKNEQELLELDKWANLWNWF- COOKSEQ ID : 4 ) ; 

NH 2 - YTS L I YS LLEKS Q I QQEKNEQELLELDKWAS LWNWF - COOH 

{SEQ ID:5) . 

5 SEQ ID:3 (DP-185) , SEQ ID:4, and SEQ ID:5 are derived 
from HIV-1 SF2 , HIV-1^, and HIV-1^ isolates, 
respectively. Underlined amino acid residues refer to 
those residues that differ from the corresponding 
position in the DP178 (SEQ ID:l) peptide. One such 
10 DP178 analog, DP-185 (SEQ ID:3), is described in the 
Example presented in Section 6, below, where it is 
demonstrated that DP-185 (SEQ ID: 3) exhibits antiviral 
activity. The DP178 analogs of the invention may also 
include truncations, as described above. Further, the 
^ analogs of the invention modifications such those 
described for DP178 analogs in Section 5.1., above. 
It is preferred that the DP178 analogs of the 
invention represent peptides whose amino acid 
sequences correspond to the DP178 region of the gp41 
protein, it is also contemplated that the peptides of 
20 the invention may, additionally, include amino 

sequences, ranging from about 2 to about 50 amino acid 
residues in length, corresponding to gp41 protein 
regions either amino to or carboxy to. the actual DP178 
amino acid sequence. 
25 Striking similarities, as shown in FIG. 1, exist 

within the regions of HIV-1 and HIV- 2 isolates which 
correspond to the DP178 sequence. A DP178 analog 
derived from the HIV-2 NIH2 isolate has the 36 amino acid 
sequence (reading from amino to carboxy terminus) : 

30 

NH 2 -LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-COOH (SEQ ID: 7) 
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Table III and Table IV show some possible truncations 
of the HIV-2 NIK2 DP178 analog, which may comprise 
peptides of between 3 and 3 6 amino acid residues 
( i.e. , peptides ranging in size from a tripeptide to a 
36-mer polypeptide) . Peptide sequences in these 
5 tables are listed from amino (left) to carboxy (right) 
terminus. "X H may represent an amino group ( -NH 2 ) and 
"Z w may represent a carboxyl (-COOH) group. 
Alternatively, W X" may represent a hydrophobic group, 
including but not limited to carbobenzyl, dansyl, or 
10 T-butoxycarbonyl; an acetyl group; a 9- 

f luorenylmethoxy-carbonyl (FMOC) group; or a 
covalently attached macromolecular group, including 
but not limited to a lipid-fatty acid conjugate, 
polyethylene glycol, carbohydrate or peptide group. 
Further, "Z" may represent an amido group; a T- 
butoxycarbonyl group; or a covalently attached 
macromolecular group, including but not limited to a 
lipid-fatty acid conjugate, polyethylene glycol, 
carbohydrate or peptide group. A preferred "X" or "Z" 
macromolecular group is a peptide group. 



25 



30 
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TABLE III 

HIV-2 NIKZ DP178 analog carboxy truncations. 

X-LEA-Z 

X-LEAN-Z 

X-LEANI-Z 

X-LEANIS-Z 
5 X-LEANISQ-Z 

X-LEANISQS-Z 

X-LEANISQSL-Z 

X-LEANISQSLE-Z 

X - LEANISQSLEQ- Z 

X-LEANISQSLEQA-Z 

X - LEANISQSLEQAQ - Z 

X - LEANISQSLEQAQ I - Z 
10 X - LEANISQSLEQAQIQ- Z 

X - LEANISQS LEQAQIQQ - Z 

X - LEANI SQSLEQAQIQQE - Z 

X- LEANI SQSLEQAQIQQEK-Z 

X- LEANI SQSLEQAQIQQEKN-Z 

X - LEANISQSLEQAQIQQEKNM - Z 

X- LEANISQSLEQAQ I QQEKNMY-Z 

X - LEANI SQSLEQAQIQQEKNMYE - Z 
15 X - LEANI SQSLEQAQIQQEKNMYEL-Z 

X - LEANISQSLEQAQIQQEKNMYELQ- Z 

X - LEANI SQSLEQAQIQQEKNMYELQK-Z 

X - LEANISQSLEQAQIQQEKNMYELQKL-Z 

X - LEAN I S QS LEQ AQ I QQEKNMYELQKLN - Z 

X - LEANISQSLEQAQIQQEKNMYELQKLNS - Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNSW- Z 

X - LEANISQSLEQAQIQQEKNMYELQKLNS WD - Z 
20 X - LEANI SQSLEQAQIQQEKNMYELQKLNSWDV- Z 

X - LEANI SQSLEQAQI QQEKNMYELQKLNSWDVF - Z 

X - LEANI SQSLEQAQI QQEKNMYELQKLNS WDVFT - Z 

X - LEAN I S QS LEQAQ I QQEKNMYELQKLNS WD VFTN - Z 

X-LEANISQSLEQAQIQQEKNMYELQKLNSWDVFTNW-Z 

X - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 



The one letter amino acid code is used. 
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TABLE IV 

HIV-2 NIH2 DP178 analog amino truncations. 

X-NWL-Z 
X-TNWL-Z 
X-FTNWL-Z 
X-VFTNWL-Z 
X-DVFTNWL-Z 

5 X-WDVFTNWL-Z 

X-SWDVFTNWL-Z 
X-NSWDVFTNWL-Z 
X - LNS WDVFTNWL - Z 
X-KLNSWDVFTNWL-Z 
X - QKLNSWD VFTNWL - Z 
X - LQKLNSWDVFTNWL - Z 
X - ELQKLNS WDVFTNWL - Z 

10 X - YELQKLNS WDVFTNWL - Z 

X - MYELQKLNS WDVFTNWL - Z 
X - NMYELQKLNS WDVFTNWL - Z 
X - KNMYELQKLNS WDVFTNWL - Z 
X -EKNMYELQKLNS WDVFTNWL - Z 
X - QEKNMYELQKLNS WDVFTNWL - Z 
X - QQEKNMYELQKLNSWD VFTNWL - Z 
X- IQQEKNMYELQKLNSWD VFTNWL- Z 
15 X-QIQQEKNM YELQKLNS WD VFTNWL -Z 

X -AQIQQEKNMYELQKLNS WDVFTNWL - Z 
X - QAQ I QQE KNMYELQKLNS WDVFTNWL - Z 
X - EQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X- LEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X-SLEQAQIQQEKNMYELQKLNSWDVFTNWL-Z 
X - QSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X - SQSLEQAQIQQEKNMYELQKLNSWDVFTNWL- Z 
20 X - 1 SQSLEQAQIQQEKNN YELQKLNS WD VFTNWL- Z 

X -NI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X -ANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 
X - EAN I SQSLEQAQ I QQEKNMYE LQKLNSWDVFTNWL - Z 
X - LEANI SQSLEQAQIQQEKNMYELQKLNS WDVFTNWL - Z 



The one letter amino acid code is used. 

25 



30 
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DP178 and DP107 analogs are recognized or 
identified, for example, by utilizing one or more of 
the 107x178x4 , ALLMOTI5 or PLZIP computer-assisted 
search strategies described and demonstrated, below, 
in the Examples presented in Sections 9 through 16 and 
5 19 through 25. The search strategy identifies 

additional peptide regions which are predicted to have 
structural and/or amino acid sequence features similar 
to those of DP107 and/or DP178. 

The search strategies are described fully, below, 
10 in the Example presented in Section 9 . While this 

search strategy is based, in part, on a primary amino 
acid motif deduced from DP107 and DP178, it is not 
based solely on searching for primary amino acid 
sequence homologies, as such protein sequence 
^ homologies exist within, but not between major groups 
of viruses. For example, primary amino acid sequence 
homology is high within the TM protein of different 
strains of HIV-1 or within the TM protein of different 
isolates of simian immunodeficiency virus (SIV) . 
Primary amino acid sequence homology between HIV-1 and 
20 SIV, however, is low enough so as not to be useful. 

It is not possible, therefore, to find peptide regions 
similar to DP107 or DP178 within other viruses, or 
within non-viral organisms, whether structurally, or 
otherwise, based on primary sequence homology, alone. 
25 Further, while it would be potentially useful to 

identify primary sequence arrangements of amino acids 
based on, for example, the physical chemical 
characteristics of different classes of amino acids 
rather than based on the specific amino acids 
3q themselves, such search strategies have, until now, 
proven inadequate. For example, a computer algorithm 
designed by Lupas et al . to identify coiled-coil 
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propensities of regions within proteins (Lupas, A., et 
al., 1991 Science 252:1162-1164) is inadequate for 
identifying protein regions analogous to DP107 or 
DP178 . 

Specifically, analysis of HIV-l gpl60 (containing 
5 both gpl20 and gp41) using the Lupas algorithm does 
not identify the coiled-coil region within DP107. It 
does, however, identify a region within DP178 
beginning eight amino acids N-terminal to the start of 
DP178 and ending eight amino acids from the C- 
10 terminus. The DP107 peptide has been shown 

experimentally to form a stable coiled coil. A search 
based on the Lupas search algorithm, therefore, would 
not have identified the DP107 coiled-coil region. 
Conversely, the Lupas algorithm identified the DP178 
15 region as a potential coiled-coil motif. However, the 
peptide derived from the DP178 region failed to form a 
coiled coil in solution. 

A possible explanation for the inability of the 
Lupas search algorithm to accurately identify coiled- 
coil sequences within the HIV-l TM, is that the Lupas 

2 0 

algorithm is based on the structure of coiled coils 
from proteins that are not structurally or 
functionally similar to the TM proteins of viruses, 
antiviral peptides ( e.g. DP107 and DP178) of which are 
an object of this invention. 

25 The computer search strategy of the invention, as 

demonstrated in the Examples presented below, in 
Sections 9 through 16 and 19 through 25, successfully 
identifies regions of proteins similar to DP107 or 
DP178. This search strategy was designed to be used 

30 with a commercially-available sequence database 
package, preferably PC/Gene. 
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A series of search motifs, the 107x178x4, 
ALLMOTI5 and PLZIP motifs, were designed and 
engineered to range in stringency from strict to 
broad, as discussed in this Section and in Section 9, 
with 107x178x4 being preferred. The sequences 
5 identified via such search motifs, such as those 
listed in Tables V-XIV, below, potentially exhibit 
antifusogenic, such as antiviral, activity, may 
additionally be useful in the identification of 
antifusogenic, such as antiviral, compounds, and are 
10 intended to be within the scope of the invention. 

Coiled-coiled sequences are thought to consist of 
heptad amino acid repeats. For ease of description, 
the amino acid positions within the heptad repeats are 
sometimes referred to as A through G, with the first 
position being A, the second B, etc. The motifs used 

15 

to identify DP107-like and DP178-like sequences herein 
are designed to specifically search for and identify 
such heptad repeats. In the descriptions of each of 
the motifs described, below, amino acids enclosed by 
brackets , i.e., [] , designate the only amino acid 

20 residues that are acceptable at the given position, 
while amino acids enclosed by braces, i.e., {}, 
designate the only amino acids which are unacceptable 
at the given heptad position. When a set of bracketed 
or braced amino acids is followed by a number in 

25 parentheses i.e., (), it refers to the number of 
subsequent amino acid positions for which the 
designated set of amino acids hold, e.g, a (2) means 
"for the next two heptad amino acid positions". 

30 
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25 



The ALLM0TI5 is written as follows: 

CFP} (3) - 
CFP} (3) - 
CFP} (3) - 
CFP} (3) - 
CFP} (3) - 



{CDGHP} -{CFP} (2) -{CDGHP}- 
{CDGHP}-{CFP} (2) -{CDGHP}- 
{ CDGHP } - { CFP } ( 2 ) - { CDGHP } - 
{CDGHP}-{CFP} (2) -{CDGHP} - 
{ CDGHP } - { CFP } ( 2 ) - { CDGHP } - 



Translating this motif, it would read: "at the 
first (A) position of the heptad, any amino acid 
residue except C, D, G, H, or P is acceptable, at the 
next two (B,C) amino acid positions, any amino acid 
residue except C, F, or P is acceptable, at the fourth 
heptad position (D) , any amino acid residue except C, 
D, G, H, or P is acceptable, at the next three (E, F, 
G) amino acid positions, any amino acid residue except 
C, F, or P is acceptable. This motif is designed to 
search for five consecutive heptad repeats (thus the" * 
repeat of the first line five times) , meaning that it 
15 searches for 35-mer sized peptides. It may also be 
designed to search for 2 8-mers, by only repeating the 
initial motif four times. With respect to the 
ALLMOTI5 motif, a 35-mer search is preferred. Those 
viral (non-bacteriophage) sequences identified via 
2 0 such an ALLMOTI5 motif are listed in Table V in U.S. 
Patent Application No. 08/470,896 filed on June 6, 
1995 which is incorporated herein by reference in its 
entirety. These viral sequences potentially exhibit 
antiviral activity, may be useful in the the 
identification of antiviral compounds, and are 
intended to be within the scope of the invention. In 
those instances wherein a single gene exhibits greater 
than one sequence recognized by the ALLMOTI5 search 
motif, the amino a cid residue numbers of these 
sequences are listed under "Area 2", Area 3", etc. 
30 This convention is used for each of the Tables listed, 
below, at the end of this Section. 
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The 107x178x4 motif is written as follows: 

[EFIKLNQSTVWY] -{CFMP} (2) - [EPIKLNQSTVWY] - {CFMP} (3) - 
[EFIKLNQSTVWY] - {CFMP} (2) - [EFIKLNQSTVWY] - {CFMP} (3) - 
(EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] -{ CFMP } (3) - 
[EFIKLNQSTVWY] -{CFMP} (2) - [EFIKLNQSTVWY] -{CFMP} (3) - 

Translating this motif, it would read: "at the 
5 first (A) position of the heptad, only amino acid 
residue E, F, I, k, l, n, Q, s, T, v, w, or Y is 
acceptable, at the next two (B,C) amino acid 
positions, any amino acid residue except C, F, M or P 
is acceptable, at the fourth position (D) , only amino 
10 acid residue E, F, I, K ( L, N, Q, S, T, V, W, or Y is 
acceptable, at the next three (E, F, G) amino acid 
positions, any amino acid residue except C, F, M or P 
is acceptable. This motif is designed to search for 
four consecutive heptad repeats (thus the repeat of 
is the first line four times) , meaning that it searches 
for 28-mer sized peptides. It may also be designed to 
search for 35-mers, by repeating the initial motif 
five times, with respect to the 107x178x4 motif, a 
28-mer search is preferred. 

Those viral (non-bacteriophage) sequences 
identified via such a 107x178x4 motif are listed in 
Table VI in U.S. Patent Application No. 08/470,896 
filed on June 6, 1995, which is incorporated herein, 
by reference, in its entirety. Those viral (non- 
bacteriophage) sequences listed in Table VII of U.S. 
25 Patent Application No. 08/470,896 (incorporated herein 
by reference in its entirety) are particularly 
preferred. 

The 107x178x4 search motif was also utilized to 
identify non-viral procaryotic protein sequences, as 
listed in Table VIII in U.S. Patent Application No. 
08/470,896 filed on June 6, 1995, which is 
incorporated herein, by reference, in its entirety. 
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Further, this search motif was used to reveal a number 
of human proteins. The results of this human protein 
107x178x4 search is listed in Table IX in U.S. Patent 
Application No. 08/470,896 filed on June 6, 1995, 
which is incorporated herein, by reference, in its 
5 entirety. The sequences listed in Tables VIII and IX, 
therefore, reveal peptides which may be useful as 
antifusogenic compounds or in the identification of 
antifusogenic compounds, and are intended to be within 
the scope of the invention. 

10 The PL2IP series of motifs are as listed in FIG. 

19. These motifs are designed to identify leucine 
zipper coiled-coil like heptads wherein at least one 
proline residue is present at some predefined distance 
N-terminal to the repeat. These PLZIP motifs find 
regions of proteins with similarities to HIV-l DP178 
generally located just N-terminal to the transmembrane 
anchor. These motifs may be translated according to 
the same convention described above. Each line 
depicted in FIG. 19 represents a single, complete 
search motif. "X" in these motifs refers to any amino 

20 acid residue. In instances wherein a motif contains 
two numbers within parentheses, this refers to a 
variable number of amino acid residues. For example, 
X (1,12) is translated to "the next one to twelve 
amino acid residues, inclusive, may be any amino 

25 acid" . 

Tables X through XIV in U.S. Patent Application 

No. 08/470,896 filed on June 6, 1995 (which is 

incorporated herein, by reference, in its entirety) , 

list sequences identified via searches conducted with 

_ such PLZIP motifs. Specifically, Table X lists viral 
30 

sequences identified via PCTL2IP, P1CTLZIP and 
P2CTLZIP search motifs, Table XI lists viral sequences 
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identified via P3CTLZIP, P4CTLZIP, P5CTLZIP and 
P6CTLZIP search motifs, Table XII lsts viral sequences 
identified via P7CTLZIP, P8CTLZIP and P9CTLZIP search 
motifs, Table XIII lists viral sequences identified 
via P12LZIPC searches and Table XIV lists viral 
5 sequences identified via P23TLZIPC search motifs The 
viral sequences listed in these tables represent 
peptides which potentially exhibit antiviral activity, 
may be useful in the identification of antiviral 
compounds, and are intended to be within the scope of 
10 the invention. 

The Examples presented in Sections 17, 18, 26 and 
2 7 below, demonstrate that viral sequences identified 
via the motif searches described herein identify 
substantial antiviral characteristics. Specifically, 
the Example presented in Section 17 describes peptides 
with anti-respiratory syncytial virus activity, the 
Example presented in Section 18 describes peptides 
with anti-parainf luenza virus activity, the Example 
presented in Section 26 describes peptides with anti- 
measles virus activity and the Example presented in 

2 0 

Section 27 describes peptides with anti-simian 
immunodeficiency virus activity. 

The DP107 and DP178 analogs may, further, contain 
any of the additional groups described for DP178, 
above, in Section 5.1. For example, these peptides 
25 may include any of the additional amino- terminal 

groups as described above for "X" groups, and may also 
include any of the carboxy- terminal groups as 
described, above, for "Z" groups. 

Additionally, truncations of the identified DP107 
3o and DP178 peptides are among the peptides of the 

invention. Further, such DP107 and DP178 analogs and 
DP107/DP178 analog truncations may exhibit one or more 
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amino acid substitutions, insertion, and/or deletions. 
The DP178 analog amino acid substitutions, insertions 
and deletions, are as described, above, for DP178-like 
peptides in Section 5.1. The DP-107 analog amino acid 
substitutions, insertions and deletions are also as 
5 described, above, for DP107-like peptides in Section 
5.2. Representative examples of such DP107/DP178 
truncations are provided in Tables XV through XXII of 
U.S. Patent Application Serial No. 08/470,896 filed on 
June 6, 1995, which is incorporated herein by 
10 reference in its entirety. 

Other exemplary DP178 and DP107 peptides and 
DPl78-like and DPl07-like peptides which are 
considered part of the present invention include the 
peptides described in U.S. Patent Application Serial 
No. 09/315,304 filed on May 4, 1999 which is 
incorporated by reference in its entirety. Such DP178 
and DP107 peptides and DP178-like and DP107-like 
peptides include, e.gr., the peptides listed below in 
Table V. 

Other DP178, DP107, DP178-like and DP107-like 
20 peptides include peptides described, e.g., in U.S. 
Patent Application Serial No. 08/038,387 filed on 
March 29, 1993, now U.S. Patent No. 5,627,023; in 
U.S. Patent Application Serial No. 08/073,028 filed on 
June 7, 1993, now U.S. Patent No. 5,464,933; in U.S. 
25 Patent Application Serial No. 08/255,208 filed on 

June, 7, 1994; in U.S. Patent Application Serial No. 
08/360,107 filed on December 20, 1994 and in U.S. 
Patent Application Serial No. 08/470,896 filed on June 
6, 1995 each of which is incorporated herein by 
reference in its entirety. 
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TABLE V 

T 

No. Sequence 

1 GIKQLQARILAVERYLKDQ 

2 NNLLRAIEAQQHLLQLTVW 

3 NEQELLELDKWASLWNWF 

4 YTSLIHSLIEESQNQQEK 

5 AC - VWG I KQLQAR I LAVER YL KDQQLLG IWG-NH2 
5 6 QHLLQLTVWG I KQLQAR I LAVE RYL KDQ 

7 LRAI EAQQHLLQLTVWG I KQLQARI LAV 

8 VQQQNNLLAR I EAQQHLLQLTVWG I KQL 
• 9 RQLLSG I VQQQNNLLRAI EAQQHLLQLT 

1 0 MTLTVQARQLLS G I VQQQNNLLRAI EAQ 

12 WSLSNGVS VLTS KVLDLKNYIDKQLL 

13 LLSTNKA WSLSNGVS VLTS KVLDLKNY 

15 AC - VLHLEGEVNKIKSALLSTNKAWSLSNG -NH2 

10 19 Ac -LLSTNKA WSLSNGVS VLTS KVLDLKNY -KH2 

20 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

21 AC - NNLLRAI EAQQHLLQLTVWG I KQLQAR ILAVER YLKDQ - NH2 

22 Ac - IELSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQST-NH2 
2 3 AC - 1 ELSN I KENKCNGTDAKVKL I KQELDKY - NH2 

24 Ac - ENKCNGTDAKVKL I KQELDKYKNAVTEL - NH2 

25 AC - DAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

26 Ac - CNGTDAKVKLIKQELDKYKNAVTELQLL-NH2 

15 27 Ac - SNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLL-NH2 

28 AC - ASGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNGV-NH2 

2 9 AC -SGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNG-NH2 

30 Ac -VLHLEGEVNKIKSALLSTHKAWSLSNGVSVLTSK-NH2 

31 Ac - ARKLQRMKQLEDKVEELLSKNYHYLENEVARLKKLV-NH2 

3 2 AC -RMKQLEDKVEELLSKNYHYLENEVARLKKLVGER-NH2 
3 3 AC - VQQQNNLLRAIEAQQHLLQLTVWGIKQL-NH2 

34 AC - LRA I EAQQHLLQLTVWG I KQLQARI LAV - NH2 

2Q 3 5 Ac - QHLLQLTVWG I KQLQAR I LAVERYLKDQ-NH2 

3 6 AC -RQLLSGIVQQQNNLLRAIEAQQHLLQLT-NH2 

3 7 AC - MTLTVQARQLLS G I VQQQNNLLRAI EAQ - NH2 

3 8 Ac - AKQARSDIEKLKEAIRDTNKAVQSVQSS -NH2 

3 9 AC-AAVALVEAKQARSDIEKLKEAIRDTNKAVQSVQSS-NH2 

4 0 AC-AKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVA-NH2 
41 Ac - GT I ALGVAT S AQ I TAA VAL VEAKQAR SD ~NH2 

4 2 AC - ATS AQ ITAAVAL VEAKQARSD I EKL KEA - NH2 

43 Ac - AAVALVEAKQARSDIEKLKEAIRDTNKANH2 

44 AC - IEKLKEAIRDTNKAVQSVQSS IGNLXVA-NH2 
4 5 AC - IRDTNKAVQSVQSSIGNLIVAIKSVQDY-NH2 
4 6 AC-AVQSVQSSIGNLIVAIKSVQDYVNKEIV-NH2 

4 7 AC - QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLARILAVER YLKDQ -NH2 

4 8 Ac - QARQLLSG I VQQQNNLLRAI EAQQHLLQ - NH2 

49 Ac - MTWMEMDREINNYTSLIGSLIEESQNQQEKNEQELLELDKWASLWNWP -NH2 

50 Ac - WMEWDREINNYTSLIGSLIEESQNQQEKNEQELLE -NH2 

51 AC - INNYTSLIGSLIEESQNQQEKNEQELLE-NH2 

30 52 AC - INNYTSLIGSLIEESQNQQEKNEQELLELDKWASL-NH2 

53 AC-EWDREINNYTSLIGSLIEESQNQQEKNEQEGGC-NH2 

54 AC-QSRTLLAGIVQQQQQLLDWKRQQELLR-NH2 
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55 AC-NNDTWQEWERKVDFLEENITALLEEAQIQQEK2IMYELQKLNSWD-NH2 

56 AC-WQEWERKVDFLEENITALLEEAQIQQEK-NH2 

57 Ac - VDFLEENITALLEEAQI QQEKNM YELQK -NH2 

58 AC - ITALLEEAQ IQQEKNMYELQKLNS WDVF -NH2 

59 AC-SSESFTLLEQWNNWKLQLAEQWLEQINEKHYLEDIS-NH2 
6 0 AC-DKWASLWNWF-NH2 

61 AC - NEQELLELDKWASLWNWF -NH2 

62 AC-EKNEQELLELDKWASLWNWF-NH2 

63 AC-NQQEKNEQELLELDKWASLWNWF-NH2 

64 AC - E S QNQQEKNEQELLELDKWAS LWNWF - NH2 

65 Ac - LIHSL IEESQNQQEKNEQELLELDKWASLWNWF - NH2 

66 AC - NDQKKLMSNNVQI VRQQS YS IMS I IKEE -NH2 

67 Ac - DEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

68 AC-VSKGYSALRTGWYTSVITIELSNIKEN-NH2 
6 9 AC - WSLSNGVSVLTSKVLDLKNYIDKQLL-NH2 

70 Ac - VNKI KSALLSTNKAWS LSNGVS VLTS K - NH2 

71 Ac - PI INFYDPLVFPSDEFDASISQVNEKINQSLAFIR -NH2 

72 AC - NLVYAQLQFT YDTLRG YINRALAQIAEA - NH2 

73 AC - LNQVDLTETLERYQQRLNTYALVSKDASYRS -NH2 

74 Ac -ELLVLKKAQLNRHSYLKDSDFLDAALD-NH2 

75 Ac - LAEAGEESVTEDTEREDTEEEREDEEE -NH2 

76 Ac - ALLAEAGEES VTEDTEREDTEEEREDEEEENEART -NH2 

77 Ac - ETERSVDLVAALLAEAGEESVTEDTEREDTEEERE-NH2 

78 AC - EESVTEDTEREDTEEEREDEEEENEART-NH2 

79 Ac - VDLVAALLAEAGEESVTEDTEREDTEEE-NH2 

80 AC - NSETERS VDLVAALLAEAGEES VTE -NH2 

8 1 AC - D I S YAQLQ FT YD VLKD YINDALRN IMDA - NH 2 

82 AC - SNVFSKDEIMREYNSQKQHIRTLSAKVNDN-NH2 

83 Biotin- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

84 Dig- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

85 Biotin-NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

86 Dig- NNLLRAIEAQQHLLQLTVWG I KQLQARILAVERYLKDQ - NH2 

87 AC-VLHQLNIQLKQYLETQERLLAGNRIAARQLLQIWKDVA-NH2 

8 8 Ac -LWHEQLLNTAQRAGLQLQLINQALAVREKVLIRYDIQK-NH2 

89 AC-LLDNFESTWEQSKELWEQQEISIQNLHKSALQEYW-NH2 

90 Ac -LSNLLQISNNSDEWLEALEIEHEKWKLTQWQSYEQF-NH2 

91 Ac -KLEALEGKLEALEGKLEALEGKLEALEGKLEALEGK-NH2 

92 AC -ELRALRGELRALRGELRALRGELRALRGK-NH2 

93 Ac - ELKAKELEGEGLAEGEEALKGLLEKAAKLEGLELLK-NH2 

94 Ac - WEAAAREAAAREAAAREAAARA-NH2 

95 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNAF-NH2 

96 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLANWF-NH2 

97 AC-YTSLIHSLIEESQNQQEKNQQELLELDKWASLWNWF-NH2 

98 AC-YTSLIHSLIEESQNQQEKNEQBLLQLDKWASLWNWF-NH2 

9 9 Ac - YTSLIHSLIEESQNQQEKNQQELLQLDKWASLWNWF-NH2 

100 AC - RMKQLEDKVEELLSKNYHLENEVARLKKLVGER--NH2 

101 Ac - QQL LQLTVWG I KQLQAR I LAVERYLKNQ - NH2 

102 AC - NEQELLELDKWASLWNWF -NH2 

103 AC - YTSL IQSL IEESQNQQEKNEQELLELDKWASLWNWF - NH2 

104 AC - I INFYDPLVFPSDEFDAS I SQVNEKINQSLAFIRK-NH2 
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105 AC-INFYDPLVFPSDEFDASISQVNEKINQSLAFIRKS-NH2 

106 Ac -NFYDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSD -NH2 

107 AC - FYDPLVFPSDEFDAS I SQVNEKINQSLAFIRKSDE - NH2 

108 Ac - YDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDEL - NH2 

109 AC-DPLVFPSDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

110 Ac - PLVFPSDEFDAS ISQVNEKINQSLAFIRKSDELLH -NH2 

111 Ac - LVFPSDEFDAS ISQVNEKINQSLAFIRKSDELLHN-NH2 

112 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

113 AC-FPSDEFDASISQVNEKINQSLAFIRKSDELLHNVN-NH2 

114 Ac - PSDEFDAS ISQVNEKINQSLAFIRKSDELLHNVNA-NH2 

115 Ac - SDEFDAS ISQVNEKINQSIAFIRKSDELLHNVNAG -NH2 

116 AC-DEFDASISQVNEKINQSLAPIRKSDELLHNVNAGK-NH2 

117 Ac -EFDASISQVNEKINQSLAFIRKSDELLHNVNAGKS -NH2 

118 Ac - FDAS ISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

119 Ac - DAS ISQVNEKINQSLAFIRKSDELLHNVNAGKSTT -NH2 
12 0 Ac -ASGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSN-NHZ 

121 AC - SGVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNG -NH2 

122 Ac - GVAVSKVLHLEGEVNKIKSALLSTNKAWSLSNGV-NH2 
12 3 Ac - VAVS KVLHLEGEVNKI KS ALLS TNKAWS LSNGVS - NH2 
124 Ac -AVSKVLHLEGEVNKIKSALLSTNKAWSLSNGVSV-NH2 
12 5 Ac -VSKVLHLEGEVNKIKSALLSTNKAWSLSNGVSVL-NH2 
12 6 Ac -SKVLHLEGEVNKIKSALLSTNKAWSLSNGVSVLT-NH2 
127 Ac - KVLHLEGEVNKIKSALLSTNKAWSLSNGVSVLTS -NH2 

12 8 Ac - VLHLEGEVNKIKSALLSTNKAWSLSNGVSVLTSK-NH2 

129 AC - LHLEGEVNKI KSALLSTNKAWS LSNGVS VLTS KV-NH2 

130 Ac - HLEGEVNKI KSALLSTNXAWSLSNGVSVLTSKVL -NH2 

131 Ac - LEGEVNKI KSALLSTNKAWS LSNGVS VLTS KVLD -NH2 

132 Ac - EGEVNKI KSALLSTNKAWS LSNGVS VLTS KVLDL -NH2 

13 3 Ac - GEVNKI KSALLSTNKAWS LSNGVS VLTS KVLDLK - NH2 

134 Ac -EVNKIKSALLSTNKAWSLSNGVSVLTSKVLDLKN-NH2 

135 AC -VNKI KSALLSTNKAWS LSNGVS VLTS KVLDLKNY-NH 2 
13 6 AC -NKIKSALLSTNKAWSLSNGVSVLTSKVLDLKNYI -NH2 

3 7 AC -KIKSALLSTNKAWSLSNGVSVLTSKVLDLKNYID-NH2 

3 8 AC - IKSALLSTNKAWSLSNGVSVLTSKVLDLKNYIDK-NH2 

13 9 Ac -KSALLSTNKAWSLSNGVSVLTSKVLDLKNYIDKQ-NH2 

14 0 AC-SALLSTNKAWSLSNGVSVLTSKVLDLKNYIDKQL-NH2 
141 Ac -ALLSTNKAWSLSNGVSVLTSKVLDLKNYIDKQLL-NH2 
14 2 Ac- YTSVITIELSNIKENKCNGTDAKVKLIKQELDKYK-NH2 

143 AC-TSVITIELSNIKENKCNGTDAKVKLIKQELDKYKN-NH2 

144 AC-SVITIELSNIKENKOTGTDAKVKLIKQELDKYKHA-NH2 
14 5 AC - VIT IELSNIKENKCNGTDAKVKLIKQELDKYKNAV - NH2 
14 6 Ac - ITI ELSNIKENKCNGTDAKVKL IKQELDKYKNAVT - NH2 
14 7 AC-TIELSNIKENKCNGTDAKVKLIKQBLDKYKNAVTE-NH2 
14 8 AC - IELSNIKENKCNGTDAKVKLIKQELDKYKNAVTEL-NH2 
14 9 Ac - EL SN I KENKCNGTDAKVKL I KQELDKYKNAVTELQ -NH2 

150 Ac-LSNI KENKCNGTDAKVKL I KQELDKYKNAVTELQL - NH2 

151 Ac - SNI KENKCNGTDAKVKL I KQELDKYKNAVTELQLL - NH2 

152 Ac -NI KENKCNGTDAKVKL I KQE LDKYKNAVTELQLLM - NE2 

153 AC - 1 KENKCNGTDAKVKL I KQELDKYKNAVTELQLLMQ -NH2 

154 AC - KEN KCNGTD AKVKL I KQELD KYKNAVTELQLLMQS -NH2 
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155 AC-ENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQST-NH2 

156 AC - LLDNFESTWEQSKELWELQE IS IQNLHKS ALQE YWN - NH2 
15 7 AC - ALGVATSAQ ITAAVALVEAKQARSD IEKLKEAI RD - NH2 
15 8 Ac -LGVATSAQITAAVALVEAKQARSDIEKLKEAIRDT-NH2 

159 AC - GVATS AQ I TAAVALVEAKQARSD IEKLKEAI RDTN -NH 2 

160 AC -VAT SAQ ITAAVALVEAKQARSD I EKL KEAIRDTNK-NH2 
5 161 AC - AT SAQ I T AAVAL VEAKQ ARS D I EKL KE A I RDTNKA - NH 2 

162 AC - TS AQ ITAAVALVEAKQARSDIE KLKEAI RDTNKA V - NH2 

163 Ac -SAQITAAVALVEAKQARSDIEKLKEAIRDTNKAVQ-NH2 

164 Ac - AQ I TAAVAL VEAKQAR S D IE KLKEA I RDTNKAVQ S - NH2 

165 Ac -Q ITAAVALVEAKQARSD IEKLKEAI RDTNKAVQSV-NH2 

166 Ac - ITAAVALVEAKQARSD I EKL KEAIRDTNKAVQSVQ-NH2 

167 Ac - TAAVAL VEAKQARS D I EKLKEAIRDTNKAVQSVQS - NH2 

168 AC - AAVAL VEAKQARSDIEKLKEAIRDTNKAVQSVQSS -NH2 
10 169 AC - AVAL VEAKQARSDIEKLKEAIRDTNKAVQSVQSS I -NH2 

170 Ac- VAL VEAKQARS D I EKLKEAIRDTNKAVQSVQS S IG - NH 2 

171 AC - ALVEAKQARSD I EKLKEAIRDTNKAVQSVQS S IGN - NH2 

172 AC - LVEAKQARSDIEKLKEA IRDTNKAVQS VQS S IGNL - NH2 

173 AC - VEAKQARSDIEKLKEAIRDTNKAVQSVQSS IGNLI-NH2 

174 Ac VEAKQARSDIEKLKEAIRDTNKAVQSVQSS IGNLIV-NH2 

175 Ac - KQARSDI EKLKEAI RDTNKAVQ SVQSS IGNL I VA I -NH2 

176 AC - QARSD I EKLKEAI RDTNKA VQS VQS S IGNL I VAIK-NH2 
15 177 Ac -ARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKS-NH2 

178 Ac -RSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSV-NH2 

179 AC-SDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQ-NH2 

180 AC-DIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQD-NH2 

181 Ac - IEKLKEAI RDTNKAVQS VQS SIGNLIVAIKSVQDY-NH2 

182 Ac - EKLKEAIRDTNKAVQSVQS S IGNL I VAI KS VQD YV - NH2 

183 Ac- KLKEAIRDTNKAVQSVQSSIGNLI VAIKSVQDYVN-NH2 

184 AC -LKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNK-NH2 
2 Q 185 AC-KEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKE-NH2 

186 Ac - EA I RDTNKA VQS VQS S IGNLIVAIKSVQDYVNKE I -NH2 

187 AC -AIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKEIV-NH2 

188 AC - IRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKEIV-NH2 

189 AC-YTPNDITLNNSVALDPIDISIELNKAKSDLEESKE-NH2 

190 Ac - TPND ITLNNSVALDP IDIS IELNKAKSDLEES KEW - NH2 

191 Ac - PNDITLNNSVALDP IDIS IELNKAKSDLEES KEWI -NH2 

192 AC-NDITLNNSVALDPIDISIELNKAKSDLEESKEWIR-NH2 

193 AC-DITLNNSVALDPIDISIELNKAKSDLEESKEWIRR-NH2 
25 194 Ac - ITLNNSVALDPIDIS IELNKAKSDLEESKEWIRRS -NH2 

195 Ac - TLNNS VALDPIDIS IELNKAKSDLEESKEWIRRSN -NH2 

196 Ac - LNNS VALDPID IS IELNKAKSDLEES KEWIRRSNQ -NH2 

197 AC-NNSVALDPIDISIELNKAKSDLEESKEWIRRSNQK-NH2 

198 Ac-NS VALDPIDIS IELNKAKSDLEESKEWIRRSNQKL-NH2 

200 Ac -SVALDPIDISIELNKAKSDLEESKEWIRRSNQKLD-NH2 

201 Ac- VALDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS-NH2 

202 Ac -ALDPID IS IELNKAKSDLEESKEWIRRSNQKLDS I -NH2 
30 203 AC -LDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS IG-NH2 

2 04 AC - DP IDIS IELNKAKSDLEESKEWIRRSNQKLDS IGN -NH2 

205 AC - PID IS IELNKAKSDLEES KEWIRRSNQKLDS IGNW-NH2 
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206 AC - IDIS IELNKAKSDLEESKEWIRRSNQKLDS IGNWH -NH2 

207 Ac -DIS IELNKAKSDLEESKEWIRRSNQKLDS IGNWHQ-NH2 

208 AC-ISIELNKAKSDLEESKEWIRRSNQKLDSIGNWHQS-NH2 

209 AC - S I ELNKAKSDLEES KEWI RRSNQKLDS I GNWHQS S -NH2 

210 Ac - I ELNKAKSDLEES KEWIRRSNQKLDS I GNWHQS ST -NH2 

211 Ac - ELNKAKSDLEES KEWIRRSNQKLDS IGNWHQSSTT-NH2 
5 212 AC - ELRALRGELRALRGELRALRGELRALRGELRALRGK-NH2 

213 Ac - YTSLIHSLIEESQNQQQKNEQELLELDKWASLWNWF-NH2 

214 Ac- YTSLIHSLIEESQNQQEKNEQELLELNKWASLWNWF-NH2 

215 Ac - YTS L I HSLIEQSQNQQEKNE QELLELDKWAS LWNWF -NH 2 

216 Ac - YTS L IHS L I QES QNQQE KNE QELLELDKWAS LWNWF - NH2 

217 AC-YTSLIHSLIQQSQNQQQKNQQQLLQLNKWASLWNWF-NH2 

218 AC - EQELLELDKWASLWNWF -NH2 

219 Ac - QELLELDKWAS LWNWF - NH2 
10 220 Ac - ELLELDKWASLWNWF-NH2 

221 Ac - LELDKWASLWNWF - NH2 

222 AC-ELDKWASLWNWF-NH2 
22 6 Ac - WAS LWNWF -NH2 
227 . Ac -AS LWNWF -NH 2 

229 Ac -YTSLIHSLIEESQNQQEKNEQELLELDKWASLANAA-NH2 

230 Ac- YTS LIHSL I EES QNQQE KNEQQLLELDKWAS LWNWF -NH2 

231 Ac - YTSLIQSLIEESQNQQEKNQQELLELDKWASLWNWF-NH2 
15 234 Ac - EAAAREAAAREAAARLELDKWASLWNWF -NH2 

236 Ac- PSLRDPISAEIS IQALS YALGGDINKVLEKLGYSG-NH2 

237 AC-SLRDPISAEISIQALSYALGGDINKVLEKLGYSGG-NH2 
2 3 8 AC-LRDPISAEISIQALSYALGGDINKVLEKLGYSGGD-NH2 

239 AC-RDPI SAEIS IQALS YALGGD INKVLEKLGYSGGDL -NH2 

240 Ac -DPISAEISIQALS YALGGD INKVLEKLGYSGGDLL-NH2 

24 1 Ac - PISAEISIQALSYALGGDINKVLEKLGYSGGDLLG-NH2 

242 Ac - 1 S AE I S IQALS YALGGD INKVLEKLG Y S GGDLLG I -NH2 

2 q 24 3 Ac -SAEIS IQALS YALGGD INKVLEKLGYSGGDLLGIL-NH2 

244 Ac - AE I S I QALS YALGGD INKVLEKLG YS GGDLLG I LE - NH 2 

245 Ac-EIS IQALS YAIiGGDINKVLEKLGYSGGDLLG ILES - NH2 
24 6 AC - 1 S I QALS YALGGDINKVLEKLG YSGGDLLG I LE S R - NH 2 
247 AC - S IQALS YALGGD INKVLEKLGYSGGDLLGILESRG-NH2 
24 8 AC - 1 QAL S YALGGD INKVLE KLG YSGGDLLG I LE SRG I -NH2 

24 9 Ac -QALS YALGGD INKVLEKLGYSGGDLLGILESRGIK-NH2 
2 50 Ac - ALSYALGGDINKVLEKLGYSGGDLLGILESRGIKA-NH2 

251 AC - LSYALGGDINKVLEKLGYSGGDLLGILESRGIKAR-NH2 

252 Ac - PDAVYLHRIDLGPPISLERLDVGTNLGNAIAKLED -NH2 

253 AC-DAVYLHRIDLGPPISLERLDVGTNLGNAIAKLEDA-NH2 

254 AC - AVYLHRIDLGPPI S LERLDVGTNLGNAI AXLEDAK - NH 2 

255 Ac - VYLHRIDLGPPISLERLDVGTNLGNAIAKLEDAKE-NH2 

256 Ac - YLHRIDLGPPISLERLDVGTNLGNAIAKLEDAKEL -NH2 

257 AC - LHRIDLGPPI SLERLDVGTNLGNAIAKLEDAKELL-NH2 
2 5 8 Ac -HRIDLGPPISLERLDVGTNLGNAIAKLEDAKELLE-NH2 

25 9 Ac - RIDLGPPISLZRLDVGTNLGNAIAKLEDAKELLES -NH2 

3 0 260 AC-IDLGPPISLERLDVGTNLGNAIAKLEDAKELLESS-NH2 

261 Ac - DLG P P I S LE RLD VGTNLGNA I AKLED AKELLES SD - NH 2 

262 AC - LGPPISLERLDVGTNLGNAIAKLEDAKELLESSDQ-NH2 
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263 AC-GPPISLERLDVGTNLGNAIAXLEDAKELLESSDQI-NH2 

2 64 AC - PPISLERLDVGTNLGNAIAKLEDAKELLESSDQIL-NH2 

265 Ac - P IS LERLDVGTNLGNAI AKLEDAKELLES SDQ ILR -NH 2 

2 66 Ac - ISLERLDVGTNLGNAIAKLEDAKELLESSDQIRS -NH2 

267 AC - SLERLDVGTNLGNAIAKLEDAKELLES SDQ ILRSM -NH2 

268 AC-LERLDVGTNLGNAIAKLEDAKELLESSDQILRSMK-NH2 
5 269 AC-EWIRRSNQKLDSI-NH2 

270 AC - LELDKWASLANAF -NH2 

271 AC - LELDKWASLFNFF -NH2 

272 AC - LELDKWASLANWF -NH2 

273 AC - LELDKWASLWNAF -NH2 

2 74 Ac -ELGNVNNSISNALDKLEESNSKLDKVNVKLTSTSA-NH2 

2 75 AC - TELGNVNNS I SNALDKLEESNSKLDKVNVKLTSTS - NH2 

276 AC - STE LGNVNNS I SNALDKLEESNS KLDKVNVKLTST - NH 2 

10 277 AC - ISTELGNVNNS I SNALDKLEESNS KLDKVNVKLTS -NH2 

2 78 Ac -DISTELGNVNNSISNALDKLEESNSKLDKVNVKLT-NH2 

279 AC-LDISTELGNVNNSISNALDKLEESNSKLDKVNVKL-NH2 

2 8 0 AC -NLD I STELGNVNNS I SNALDKLEESNS KLDKVNVK- NH 2 

2 81 AC-GNLDISTELGNVNNSISNALDKLEESNSKLDKVNV-NH2 

2 82 . AC-TGNLDISTELGNVNNSISNALDKLEESNSKLDKVN-NH2 

2 83 AC-VTGNLDISTELGNVNNSISNALDKLEESNSKLDKV-NH2 

2 84 AC - 1 VTGNLD ISTELGNVNNS I SNALDKLEESNS KLDK - NH2 

15 2 85 Ac -VIVTGNLDI STELGNVNNS ISNALDKLEESNSKLD-NH2 

2 86 AC-QVIVTGNLDISTELGNVNNSISNALDKLEESNSKL-NH2 

2 87 AC - SQVI VTGNLD ISTELGNVNNS I SNALDKLEESNS K - NH2 

2 8 8 Ac - DSQ VI VTGNLD I STELGNVNNS I SNALDKLEESNS -NH2 

2 89 AC -LDSQVIVTGNLD I STELGNVNNS I SNALDKLEESN-NH2 

2 90 Ac - ILDSQVIVTGNLDISTELGNVNNSISNALDKLEES -NH2 

2 91 Ac - S ILDSQVIVTGNLD I STELGNVNNS I SNALDKLEE - NH2 

2 92 AC-ISILDSQVIVTGNLDISTELGNVNNSISNALDKLE-NH2 

20 293 Ac -NIS I LDSQVIVTGNLD I STELGNVNNS I SNALDKL-NH2 

2 94 AC-KNISILDSQVIVTGNLDISTELGNVNNSISNALDK-NH2 

2 95 Ac -QKNIS ILDSQVIVTGNLD ISTELGNVNNS ISNALD-NH2 

296 Ac - YQKN I S I LDSQVIVTGNLD I STELGNVNNS I SNAL -NH2 

2 97 Ac- T YQKN I S I LD S QV I VTGNLD I STELGNVNNS I SNA - NH2 

2 98 Ac -ATYQKNISILDSQVIVTGNLDISTELGNVNNS ISN-NH2 
299 Ac - DATYQKN IS I LD S QVI VTGNLD I STELGNVNNS IS - NH 2 

3 00 Ac - FDATYQKNIS ILDSQVIVTGNLD I STELGNVNNS I -NH2 

301 AC-EFDATYQKNISILDSQVIVTGNLDISTELGNVNNS-NH2 

302 Ac -GE FDATYQKNIS ILDSQVIVTGNLD I STE LGNVNN-NH2 

303 AC-SGEFDATYQKNISILDSQVIVTGNLDISTELGNVN-NH2 
3 04 Ac -LSGEFDATYQKNISILDSQVIVTGNLDISTELGNV-NH2 
3 05 AC-RLSGEFDATYQKNISILDSQVIVTGNLDISTELGN-NH2 
306 AC- LRLSGEFDATYQKNIS ILDSQVI VTGNLDISTELG - NH2 
3 07 AC - TLRLSGEFDATYQKNIS ILDSQVIVTGNLD I STEL-NH2 
3 0 8 Ac -ITLRLSGEFDATYQKNI SI LDSQVIVTGNLD I STE -NH2 
3 09 Ac - G ITLRLSGEFDATYQKNI S ILDSQVIVTGNLD I ST - NH2 

30 310 Ac - TATIEAVHEVTDGLSQLAVAVGKMQQFVNDQFNNT -NH2 

311 Ac - ITAT IEAVHE VTDGLS QLAVAVGKMQQFVNDQFNN - NH2 

312 AC - S I TAT I EAVHEVTDGL S QLAVAVGKMQQFVNDQFN - NH 2 
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314 AC-KESITATIEAVHEVTDGLSQLAVAVGKMQQFVNDQ-NH2 

315 AC-LKES ITAT I EAVHEVTDGLSQLAVAVGKMQQFVND - NH2 

316 Ac - RLKES ITATIEAVHEVTDGLSQLAVAVGKMQQFVN - NH2 

317 Ac - LRLKES ITATIEAVHEVTDGLSQLAVAVGKMQQFV -NH2 
3 1 B Ac - ILRLKES ITATIEAVHEVTDGLSQLAVAVGKMQQF -NH2 
319 Ac - NI LRLKES ITATIEAVHSVTDGLSQLAVAVGKMQQ - NH2 

5 320 AC-ANILRLKESITATIEAVHEVTDGLSQIAVAVGKMQ-NH2 

321 Ac - AANILRLKES ITATIEAVHEVTDGLSQLAVAVGKM - NH2 

322 AC-HKCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGV-NH2 

323 AC-KCDDECMNSVKNGTYDYPKYEEESKLNRNEIKGVK-NH2 

324 AC - CDDECMNS VKNGTYD YPKYEEES KLNRNE I KGVKL - NH2 
32 5 Ac -DDECMNSVKNGTYDYPKYEEESKLNRNEIKGVKLS -NH2 
32 6 AC - DECMNS VKNGTYDYPKYEEES KLNRNE I KGVKLS S - NH2 

32 7 AC-ECMNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSM-NH2 
10 32 8 Ac - CMNSVKNGTYD YPKYEEBSKLNRNE I KGVKLSSKG - NH2 

329 AC-MNSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMGV-NH2 

33 0 Ac -NSVKNGTYDYPKYEEESKLNRNEIKGVKLSSMGVY-NH2 

331 AC-SVKNGTYDYPKYEEESKLNRNEIKGVKLSSMGVYQ-NH2 

332 Ac -VKNGTYD YPKYEEBSKLNRNE IKGVKLS SMGVYQI - NH2 

333 Ac - KNGTYD YPKYEE ES KLNRNE X KG VKLS SMGVYQ I L - NH 2 
3 3 4 AC-AFIRKSDELLHNV-NH2 

3 3 5 AC - WLAGAALGVATAAQ ITAG IALHQSMLNS QAIDNL - NH2 

15 3 3 6 Ac - VLAGAALG VATAAQ ITAGI ALHQS MLNS QAIDNLR - NH2 

3 3 7 AC-LAGAALGVATAAQITAGIALHQSMLNSQAIDNLRA-NH2 

33 8 Ac - AGAALGVATAAQITAGIALHQSMLNSQAIDNLRAS -NH2 

33 9 AC-GAALGVATAAQITAGIALHQSMLNSQAIDNLRASL-NH2 

34 0 Ac- AALGVATAAQ I TAG IALHQSMLNS QAIDNLRAS LE - NH 2 

341 AC - ALGVATAAQ ITAG IALHQSMLNS QAIDNLRAS LET - NH2 

342 Ac - LGVATAAQ ITAG IALHQSMLNS QA IDNLRAS LETT - NH2 

343 Ac - GVATAAQ ITAG IALHQ SMLNS Q A IDNLRAS LETTN - NH2 
20 344 Ac - VATAAQ ITAG IALHQS MLNS QA IDNLRAS LETTNQ - NH2 

34 5 AC - ATAAQ I TAG IALHQSMLNS QA IDNLRAS LETTNQA - NH2 

34 6 AC - TAAQ I TAG I ALHQSMLNSQAIDNLRASLETTNQAI -NH2 

34 7 Ac - AAQ ITAG IALHQSMLNSQAIDNLRASLETTNQAIE - NH2 

34 8 Ac - AQ ITAG IALHQS MLNS QAIDNLRASLETTNQAIEA - NH2 

34 9 Ac -QITAGIALHQSMLNSQAIDNLRASLETTNQAIEAI -NH2 

350 AC - ITAGIALHQSMLNSQAIDNLRASLETTNQAIEAIR-NH2 

3 51 Ac - TAG I ALHQSMLNSQAIDNLRASLETTNQAI EAIRQ - NH2 

25 352 Ac - AG IALHQ SMLNS QA IDNLRAS LETTNQA I EAIRQA - NH2 

3 53 AC-GIALHQSMLNSQAIDNLRASLETTOQAIEAIRQAG-NH2 

3 54 AC - IALHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQ-NH2 

3 55 AC - ALHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQE -NH2 

3 56 AC - LHQSMLNSQAIDNLRASLETTNQAIEAIRQAGQEM - NH2 

3 57 AC - HQSMLNS QAIDNLRAS LETTNQA IEAIRQAGQEM I -NH2 

3 5 8 AC - QSMLNSQAIDNLRAS LETTNQA I EAIRQAGQEMIL-NH2 

3 5 9 Ac - SMLNS QAIDNLRAS LETTNQAIEAIRQAGQEMI LA -NH2 

360 Ac - MLNSQAIDNLRASLETTNQAIEAIRQAGQEMILAV-NH2 

3 61 AC - LNSQAIDNLRASLETTNQAIEAIRQAGQEMILAVQ-NH2 

3 62 Ac - NSQAIDNLRASLETTNQAIEAIRQAGQEMILAVQG -NH2 

3 63 Ac - S QAIDNLRAS LETTNQA IEAIRQAGQEM I LAVQGV - NH2 
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364 AC-QAIDNLRASLETTNQAIEAIRQAGQEMILAVQGVQ-NH2 

365 AC - AIDNLRASLETTNQAIEAIRQAGQEMILAVQGVQD -NH2 
3 66 Ac - IDNLRASLETTNQAIEAIRQAGQEMILAVQGVQDY-NH2 
367 AC -DNLRASLETTNQAIEAIRQAGQEMILAVQGVQDYI -NH2 
3 68 Ac -NLRASLETTNQAIEAIRQAGQEMILAVQGVQDYIN-NH2 
369 Ac -LRASLETTNQAIEAIRQAGQEMILAVQGVQDYINN-NH2 

5 370 Ac - RASLETTNQAIEAIRQAGQEMILAVQGVQDYINNE -NH2 

371 AC-YTSVITIELSNIKENKUNGTDAVKLIKQELDKYK-NH2 

372 AC - TS VIT IELSNIKENKUNGTDAVKL I KQELDKYKN -NH2 

373 Ac -S VIT IELSNIKENKUNGTDAVKL I KQELDKYKNA-NH2 

374 Ac -SNIKENKUNGTDAKVKLIKQELDKYKNAVTELQLL-NH2 

375 AC - KENKDNGTDAKVKL I KQELDKYKNAVTELQLLMQS - NH2 

376 Ac - CLELDKWASLWNWFC -NH2 

377 Ac -CLELDKWASLANWFC-NH2 
10 378 Ac -CLELDKWASLFNFFC-NH2 

379 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLFNFF-NH2 

381 Ac -RMKQLEDKVEELLSKNYHLENELELDKWASLWNWF-NH2 

3 82 Ac -KVEELLSKNYHLENELELDKWASLWNWF-NH2 

3 83 Ac - RMKQLEDKVEELLSKLEWIRRSNQKLDS I -NH2 

384 Ac - RMKQLEDKVEELLSKLAFIRKSDELLHNV-NH2 

385 AC - ELEALRGELRALRGELELDKWASLWNWF -NH2 

386 Ac - LDPIDIS IELNKAKSDLEESKEWIRRSNQKLDS I -NH2 
15 3 87 Ac -CNEQLSDSFPVEFFQV-NH2 

388 AC -MAEDDPYLGRPEQMFHLDPSL-NH2 

3 89 Ac -EDFSSIADMDFSALLSQISS -NH2 

3 90 AC -TWQEWERKVDFLEENITALLEEAQIQQEKNMYELQ-NH2 

391 Ac - WQEWERKVDFLEENITALLEEAQIQQEKNMYELQK-NH2 

3 92 Ac - QEWERKVDFLEENITALLEEAQIQQEKNMYELQKL-NH2 

3 93 Ac -EWERKVDFLEENITALLEEAQIQQEKNMYELQKLN-NH2 

3 94 Ac - WERKVDFLEENITALLEEAQIQQEKNMYELQKLNS-NH2 

20 3 95 Ac - ERKVDFLEENITALLEEAQIQQEKNMYELQKLNSW-NH2 

3 96 Ac -RKVDFLEENITALLEEAQIQQEKNMYELQKLNSWD-NH2 

3 97 Ac - KVDFLEENITALLEEAQIQQEKNMYELQKLNSWDV-NH2 

3 9 8 AC -VDFLEENITALLEEAQIQQEKNMYELQKLNSWDVF-NH2 

3 99 AC -DFLEENITALLEEAQIQQEKNMYELQKLNSWDVFG-NH2 

4 00 Ac - FLEENITALLEEAQIQQEKNMYELQKLNSWDVFGN-NH2 
4 01 AC -LEENITALLEEAQIQQEKNMYELQKLNSWDVFGNW-NH2 
4 02 AC - LEENITALLEEAQ IQQEKNMYELQKLNS WD VFGNWF - NH2 

2 4 03 AC -NEQSEEKENELYWAKEQLLDLLFNIFNQTVGAWIMQ - NH2 

4 05 Ac - QQQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKD -NH2 

4 06 Ac - QQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ -NH2 

4 07 Ac - QQLLDVVKRQQELLRLTVWGPKNLQTRVTAIEKYLKDQ-NH2 

4 08 Ac -DERKQDKVLWQQTGTLQLTLIQLEKTAKLQWVRLNRY-NH2 

4 09 AC - QQQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKY-NH2 

410 Ac - QQLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYL -NH2 

411 AC -QLLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLK-NH2 

412 Ac - LLDWKRQQELLRLTVWGTKNLQTRVTAIEKYLKD -NH2 

3 0 413 AC-LDVVKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQ-NH2 

414 AC - D WKRQQELLRLTVWGTKNLQTRVTAI EKYLKDQA - NH2 

415 AC-WKRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQ-NH2 
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416 Ac - VKRQQELLRLTVWGTKNLQTRVTAI EKYLKDQAQL - NH 2 

417 Ac - KRQQELLRLTVWGTKNLQTRVTAIEKYLKDQAQLN-NH2 

418 Ac - RQQELLRLTVWGTKNLQTRVTAI EKYLKDQAQLNA -NH2 

419 Ac - QQELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNAW-NH2 
4 2 0 Ac - QELLRLTVWGTKNLQTRVTAIEKYLKDQAQLNAWG -NH2 
4 21 AC - ELLRLTVWGTK2JLQTRVTAIEKYLKDQAQLNAWGC -NH2 

5 422 AC -NNLLRAIEAQQHLLQLTVWGPKQLQARILAVERYLKD0-NH2 

42 3 AC - SELE IKRYKNRVAS RKCRAKFKQLLQHYRE VAAAK - NH2 

4 24 AC - ELEI KR YKNRVASRKCRAKFKQLLQHYREVAAAKS -NH2 

425 AC-LEIKRYKNRVASRKCRAKFKQLLQHYREVAAAKSS-NH2 

42 6 AC - E I KRYKNRVA5 RKCRAKFKQLLQHYREVAAAKS S E - NH2 

4 27 Ac - 1 KR YKNRVASRKCRAKFKQLLQHYREVAAAKS SEN-NH2 

4 2 8 Ac - KR YKNRVAS RKCRAKFKQLLQHYREVAAAKS S END -NH2 

4 2 9 Ac -R YKNRVAS RKCRAKFKQLLQHYREVAAAKSSENDR-NH2 

10 430 AC - YKNRVASRKCRAKFKQLLQHYREVAAAKS S ENDRL - NH2 

4 31 AC - KNRVASRKCRAKFKQLLQHYREVAAAKSSENDRLR-NH2 

4 32 Ac - NRVAS RKCRAKFKQLLQHYREVAAAKS SENDRLRL-NH2 

4 33 Ac -RVAS RKCRAKFKQLLQHYREVAAAKS SEND RLRLL-NH2 

4 34 Ac -VAS RKCRAKFKQLLQHYREVAAAKS SENDRLRLLL-NH2 

4 3 5 Ac - AS RKCRAKFKQLLQHYREVAAAKS S ENDRLRLLLK - NH2 

4 3 6 Ac - S RKCRAKFKQLLQHYREVAAAKS S END RLRLLLKQ-NH2 

4 3 7 Ac -RKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQM-NH2 

15 43 8 Ac - KCRAKFKQLLQHYREVAAAKS SENDRLRLLLKQMC - NH2 

4 3 9 AC - CRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCP-NH2 

44 0 Ac - RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPS -NH2 

441 AC - AKFKQ LLQH YREVAAAKS S ENDRLRL LL KQM C PS L - NH2 

4 42 AC - KFKQLLQHYRE VAAAKS S ENDRLRLLLKQM CPS LD - NH2 

44 3 Ac - FKQLLQHYREVAAAKSS ENDRLRLLLKQMC PSLDV-NH2 

4 4 4 AC - KQLLQHYREVAAAKS SENDRLRLLLKQMC PS LDVD-NH 2 

445 AC- QLLQHYREVAAAKS S ENDRLRLLLKQMC P SLDVDS - NH2 

2 0 44 6 AC - LLQHYREVAAAKSS END RLRLLLKQMCP SLDVDS I - NH2 

44 7 AC-LQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSII-NH2 

44 8 AC - QHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIP-NH2 

44 9 Ac - H YREVAAAKS S ENDRLRLLLKQMC P SLDVDS I IPR- NH2 

4 5 0 Ac - YREVAAAKS S ENDRLRLLLKQMCPSLDVDS I IPRT-NH2 

4 51 AC - REVAAAKS S ENDRLRLLLKQMCPSLDVD S 1 1 PRTP - NH2 

4 5 2 Ac - EVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD-NH2 

4 5 3 Ac - VAAAKSS ENDRLRLLLKQMCPSLDVDS I IPRTPDV - NH2 

4 54 Ac - AAAKS S ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVL - NH2 

455 Ac - AAKS S ENDRLRLLLKQMCPSLDVD S 1 1 PRTPDVLH - NH2 

4 56 Ac - AKS S ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHE - NH2 

4 57 Ac - KS S ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHED -NH2 

458 Ac * S S ENDRLRLLLKQM CP S LDVDS 1 1 PRTPD VLHEDL - NH2 

4 5 9 AC - S ENDRLRLLLKQMCPSLDVDS 1 1 PRTPDVLHEDLL - NH2 

4 60 Ac - ENDRLRLLLKQMCPSLDVDS I X PRTPD VLHEDLLN - NH2 

4 61 Ac - NDRLRLLLKQMCPS LDVDS II PRTPDVLHEDLLNF - NH2 

534 Ac - PG YR WMCLRRF 1 1 FLF I LLLCL I FLL VLLD YQGML - NH2 

30 53 5 Ac - G YRWMCLRRF 1 1 FLF ILLLCLI FLLVLLDYQGMLP - NH2 

53 6 Ac - YRWMCLRRF 1 1 FLF ILLLCL I F LL VLLD YQGMLP V - NH2 

53 7 AC-RWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVC-NH2 
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53 8 AC-WMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCP-NH2 

53 9 AC-MCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPL-NH2 

540 AC - CLRRFII FLFILLLCLIFLLVLLDYQGMLPVCPLI -NH2 

54 1 AC-LRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIP-NH2 

542 AC-RRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPG-NH2 

543 Ac -RFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGS -NH2 
5 544 AC-FIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSS-NH2 

545 Ac- IIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSST-NH2 

546 AC-IFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTT-NH2 

547 AC - FLFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTS -NH2 

548 AC-LFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTST-NH2 

54 9 Ac - FILLLCLI FLLVLLDYQGMLPVCPL I PGS STTSTG - NH2 

550 AC - 1 LLLCL X FLLVLLDYQGMLP VCPLI PG S STTSTGP - NH2 

551 AC - LLLCL I FLLVLLDYQGMLPVCPL I PGS STTSTGPC - NH2 
10 552 AC-LLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCR-NH2 

553 Ac - LCL I FLLVLLDYQGMLPVCPL I PGS STTSTGPCRT- NH2 

554 Ac -CL I FLLVLLDYQGMLPVCPL I PGS STTSTG PCRTC-NH2 

555 Ac -LIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCM-NH2 

556 Ac - IFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMT-NH2 
5 57 Ac - FLLVLLDYQGMLPVCPL I PGS STTSTG PCRTCMTT-NH2 

558 AC - P P L VLQAGF FLLTR ILT I P QS LDS WWTS LNFLGGT - NH2 

559 AC - LL VLQAGFFLLTR I LT I PQS LDS WWTS LNFLGGTT - NH2 
15 560 AC-LVLQAGFFLLTRILTIPQSLDSWWTSLMFLGGTTV-NH2 

5 61 Ac - VLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVC-NH2 

562 Ac - LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCL -NH2 

563 AC - QAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLG-NH2 

564 AC - AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ-NH2 

565 AC - GFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQN-NH2 

566 AC -FFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNS -NH2 

567 Ac - FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ-NH2 
2 q 568 Ac -LLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQS -NH2 

569 Ac - LTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQSP -NH2 

57 0 AC - FWNWLSAWKDLELKS LLEEVKDELQKMR - NH2 

571 AC -NNLLRAIEAQQHLLQLTVW-NH2 

572 AC - CGGNNLLRAIEAQQHLLQLTVWG I KQLQARILAVERYLKDQ -NH2 

573 Ac- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

574 C13H27CO- YTSLIHSLI EESQNQQEKNEQELLELDKWASLWNWF -NH2 
57 5 AC - AVS KGYLS ALRTG WYTS VI T I ELSN I KENKUNGTDA - NH2 

576 Ac - S ISNIETVIEFQQKNNRLLEITREFSVNAGVTTPVS -NH2 

5 77 AC - DQQI KQYKRLLDRLI I PLYDGLRQKDVI VSNQESN -NH2 

5 78 Ac - YSELTNIFGDNIGSLQEKGIKLQGIASLYRTNITEI -NH2 

57 9 Ac -TS ITLQVRLPLLTRLLNTQI YRVDSIS YNIQNREWY -NH2 

580 Ac - VEIAEYRRLLRTVLEPIRDALNAMTQMIRPVQSVA-NH2 

581 Ac - S YFIVLS 1 AYPTLSEIKGVIVHRLEGVSYNIGSQEW-NH2 

582 AC -LKEAIRDTNKAVQSVQSSIGNLIVAIKS -NH2 

583 NNLLRAI EAQQHLLQLTVWG I KQLQARILAVERYLKDQ -NH2 
5 8 3 NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

30 584 QKQEPIDKELYPLTSL 

5 85 YPKFVKQNTLKLAT 

5 86 QYIKANQKFIGITE 
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587 NGQIGNDPNRDILY — * ______ 

588 AC-RPDVY-OH 

589 CLELDKWASLWNWFC- (cyclic) 
5 90 CLELDKWASLANWFC- (cyclic) 
5 91 CLELDKWASLANFFC- (cyclic) 

5 94 AC-NNL.LRAIEAQQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 

5 595 Ac - CGGVTSLIHSLIEESQNQQEKNEQELLELDKWASLWNNWF - NH2 

596 Ac - PLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGT-NH2 

5 97 AC-LLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTT-NH2 

598 AC-LVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTV-NH2 

599 AC-VLQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVC-NH2 

600 AC-LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCL-NH2 

601 Ac - QAGFFLLTRILTI PQSLDSWWTSLNFLGGTTVCLG - NH2 

602 AC-AGFFLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQ-NH2 
10 603 Ac - GFFLLTRILTI PQSLDSWWTSLNFLGGTTVCLGQN - NH2 

604 Ac - FFLLTRI LTI PQSLDS WWTSLNFLGGTTVCLGQNS -NH2 

605 AC-FLLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQ-NH2 

606 Ac -LLTRILTIPQSLDSWWTSLNFLGGTTVCLGQNSQS -NH2 

607 AC - LTRILTI PQSLDSWWTSLNFLGGTTVCLGQNSQSP -NH2 

608 AC - LELDKWASLWNWA -NH2 

609 AC - LELDKWASAWNWF-NH2 

610 Ac - LELDKAASLWNWF - NTH 2 
15 611 AC - LKLDKWASLWNWF-NH2 

612 AC - LELKKWASLWNWF-NH2 

613 AC-DELLHNVNAGKST-NH2 

614 AC - KS DELLHNVNAGKST - NH2 

615 AC - IRKSDELLHNVNAGKST-NH2. 

616 Ac - AFIRKSDELLHNVNAGKST-NH2 

617 Ac - FD AS I S QVNE KINQS LAF I - NH 2 

618 AC-YAADKESTQKAFDGITNKVNSVIEKMNTQFEAVGKE-NH2 
20 619 Ac " SVIEKMNTQFEAVGKEFGNLERRLENLNKRMEDGFL-NH2 

62 0 Ac - VWTYNAELLVLMENERTLDFHDSNVKNLYDKVRMQL -NH2 

621 Ac - EWDREINNYTSLIHSLIEESQNQQEKNEQEGGC-NH2 

622 Ac - INNYTSLIHSLIEESQNQQEKKEQELLELDKWASL-NH2 
62 3 Ac - INNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

624 Ac - WMEWDRE INNYTSLIHSLIEESQNQQEKNEQELLE -NH2 

625 Ac -MTWl^WDREINim:SLIHSLIEESQKQOEKNEOELLELDKWASLWNWF -NH2 
62 6 Ac - IDISIELNKAKSDLEESKEWIKKSNQKLDSIGNWH-NH2 

2 627 Ac -NQQEKNEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 

62 7 AC -NQQEKKEQELLELDKWASLWNWFNITNWLWYIKIFI -NH2 

628 AC - QNQQE KNEQELLELDKWAS LWNW FN ITNWLWY I KI F - NH 2 

62 9 AC - SQNQQEKNEQELLELDKWASLWNWFNITNWLWYIKI -NH2 

63 0 Ac - ESQNQQEKNEQELLELDKWASLWNWFNITNWLWYIK-NH2 

631 Ac - EESQNQQEKNEQELLEIiDKWASLWNWFNITNWLWY I -NH2 

632 Ac - 1 EESQNQQEKNEQELLELDKWASLWNWFNITNWLWY - NH2 

633 Ac - LIEESQNQQEKNEQELLELDKWASLWNWFNITNWLW-NH2 

634 AC - SLIEESQNQQEKNEQELLELDKWASLWNWFNITNWL-NH2 
30 635 AC-HSLIEESQNQQEKNEQELLELDKWASLWNWFNITNW-NH2 

636 Ac - IHSLIEESQNQQEKNEQELLELDKWASLWNWFlf ITN-NH2 

63 7 Ac - LIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT-NH2 
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638 Ac - SLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI -NH2 

63 9 AC-TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

64 0 AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-NH2 

641 AC-NNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 

642 Ac - INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLW-NH2 
64 3 AC-EINNYTSLIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

5 64 4 AC - RE INNYTSLIHSLIEESQNQQEKNEQELLELDKWAS -NH2 

64 5 AC-DREINNYTSLIHSLIEESQNQQEKNEQELLELDKMA-NH2 

64 6 AC-WDREINNYTSLIHSLIEESQNQQEKNEQELLELDKW-NH2 

647 Ac -EWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK-NH2 

64 8 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELLELD-NH 

64 9 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

650 AC-TWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE-NH2 

651 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 
10 652 Ac -NMTWMEWDRE INNYTSLIHSLIEESQNQQEKNEQEL - NH2 

653 AC-NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQE-NH2 

654 AC-WNNMTWMEWDREINNVTSLIHSLIEESQNQQEKNEQ-NH2 

655 AC - I WNNMTWME WDRE INNYTSL IHSLIEESQNQQEKNE -NH2 

656 AC-QIWNNOTWMEWDREINNYTSLIHSLIEESQNQQEKN-NH2 

657 AC-EQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEK-NH2 

65 8 AC-LEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQE-NH2 
659 AC - SLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQ -NH2 

15 66 0 Ac - KS LEQ I WNNMTWMEWDRE INNYTS L IHSL I EES QNQ - NH2 

661 AC-NKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQN-NH2 

662 AC- SLAFIRKSDELLHNVNAGKST - NH2 

66 3 AC - FDASISQVNEKINQSLAFIRK-NH2 

664 AC - YTSLIHSLIEESQQQQEKQEQELLELDKWASLWNWF-NH2 

66 5 AC - FDAS I SQVNEKINQSLAF IRKSDELLHNVNAGK-NH2 

666 Ac - FDAS ISQVNEKINQSLAFIRKSDELLHNVNA-NH2 

667 AC - FDAS I SQVNEKINQSLAF IRKSDELLHNV - NH2 
2 q 668 Ac -FDAS I SQVNEKINQSLAF IRKS DELLH-NH2 

669 Ac -FDAS I SQVNEKINQSLAF I RKSDEL-NH2 

670 Ac - FDAS I SQVNEKINQSLAF I RKSD-NH2 

671 AC - ASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

672 Ac - ISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 
6 73 Ac -QVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

674 AC - NEKINQS LAF IRKSDELLHNVNAGKST - NH2 

675 Ac -KINQS LAF IRKSDELLHNVNAGKST -NH2 

676 Ac -NQSLAFIRKSDELLHNVNAGKST-NH2 

677 Ac -FWNWLSAWKDLELYPGSLELDKWASLWNWF-NH2 

678 Ac - CGGNNLLRAI EAQQHLLQLTVWG IKQLQARI LAVERYLKDQ - NH2 
579 AC-CGGYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

680 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

681 NNLLRAI EAQQHLLQLTVWG I KQLQARILAVER YLKDQ 

682 Ac -EKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQYGV-NH2 

683 AC -QEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQYG-NH2 
- 684 AC-QQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQY-NH2 

30 6 8 5 AC -IQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQYIQ-NH2 

68 6 AC - QIQQEKNMYELQKLNSWDVFTNWLDFTS WVRYIQYI -NH2 

6 8 7 AC - AQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQY-NH2 
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688 Ac -QAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYIQ-NH2 — " — 

689 AC - EQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRYI -NH2 

690 Ac -LEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVRY-NH2 

691 AC - SLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWVR - NH2 

692 AC - QSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTSWV-NH2 

693 Ac - SQSLEQAQIQQEKNMYELQKLNS WD VFTNWLDFTSW - NH2 
5 694 AC - ISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFTS -NH2 

695 AC-NISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDFT-NH2 

696 Ac -ANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLDF-NH2 

697 AC-EANISQSLEQAQIQQEKNMYELQKLNSWDVFTNWLD-NH2 

6 99 Ac - YLEANISQSLEQAQIQ0EKNMYELQKLNSWDVFTNW-NH2 

700 AC - YTSLIHSLIEESQNQQEKNEQEL-NH2 

701 AC-YTSLIHSLIEESQNLQEKNEQELLELDKWASLWNWF-NH2 

702 AC-YTSLIHSLIEESQNQQEKLEQELLELDKWASLWNWF-NH2 
10 703 AC-YTSLIHSLIEESQNQQEKNEQELLEFDKWASLWNWF-NH2 

7 04 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKPASLWNWF-NB2 

705 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASPWNWF-NH2 

706 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNSF-NH2 

707 Biotin NH (CH2 ) 4CO-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

708 Biotin NH (CH2 ) 6 CO - YTS L IHS L I EE S QNQQE KNEQELL ELDKWAS L WNWF - NH2. 

709 FMOC - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

710 FMO C - NNLLRAI EAQQHLLQLTVWG I KQLQAR I LA VER YLKDQ 
15 711 Ac - EWDREINNYTSLIHSLIEESQNQQEKNEQE -NH2 

712 AC-LIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

713 Ac - FWN WLSAWKDLELGGPGSGPGGLELDKWASLWNWF - NH2 

714 AC - L IHSLIEESQNQQEKNEQELLELDKWASL - NH2 

715 AC -TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

716 AC - L IH S L I EE S QNQQE KNEQE L LELDKWAS LWNWF - NH 2 

718 FMOC -GGGGG YTS LIHSL I EESQNQQEKNEQELLELDKWASLWNWF-NH2 

719 Ac -HSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

20 720 AC-YTSLIYSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

721 AC-YTSLIHSLIEKSQNQQEKNEQELLELDKWASLWNWF-NH2 

722 AC - YTSLIHSS IEESQNQQEKNEQELLELDKWASLWNWF-NH2 

723 AC-LEANISQLLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

724 Ac - SLEECDSELEIKRYKNRVASRKCRAKFKQLLQHYR-NH2 

725 Ac - LEECDS ELE I KRYKNRVASRKCRAKFKQLLQHYRE -NH2 
72 6 Ac - EECDSELE IKRYKNRVASRKCRAKFKQLLQHYREV-NH2 

727 Ac - ECDSELEIKRYKNRVASRKCRAKFKQLLQHYREVA- NH2 

728 AC - CDSELEIKRYKNRVASRKCRAKFKQLLQHYREVAA -NH2 

72 9 AC - DSELEI KR YKNRVAS RKCRAKF KQLLQHYREVAAA -NH2 

73 0 De s aminoty r o s ine - FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

731 WASLWNW-NH2 

732 Ac - EAQQHLLQLTVWGIKQLQARIIAVERYLKDQQLLGIWG - NH2 
73 3 AC - 1 EAQQHLLQLTVWG I KQLQAR ILAVERYLKDQQLLG I W - NH2 
734 AC -AIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGI -NH2 
73 5 AC - RAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLG -NH2 
736 Ac - LRAIEAQQHLLQLTVWGI KQLQARI LAVERYLKDQQLL - NH2 

30 737 Ac -LLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQL-NH2 

73 8 Ac - NLLRAI EAQQHLLQLTVWG I KQLQAR I LAVERYLKDQQ - NH2 

73 9 Ac -QNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKD-NH2 
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74 0 AC-QQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLK-NH2 

741 AC - QQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYL -NH2 

74 2 AC - VQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERY-NH2 

74 3 AC - IVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVER-NH2 

744 AC - GIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVE -NH2 

745 AC-SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAV-NH2 
5 75 8 AC - RSMTLTVQARQLLSGI VQQQNNLLRAIEAQQHLLQLTV-NH2 

76 0 AC - GARS MTLTVQARQLL S G I VQQ QNNLLRAI EAQQHLLQL -NH2 

764 Ac - G S TMGARSMTLTVQ ARQLLSG I VQQQNNLLRA I EAQQH - NH2 

765 AC - GSTMGARSHTLTVQ ARQLLSG I VQQQNNLLRA I EAQQH -NH2 

766 Ac - EGSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQ -NH2 

767 AC - RAKF KQLLQHYREVAAAKS S ENDRLRLL - NH2 

76 8 Ac - AKFKQLLQHYREVAAAKSSENDRLRLLL - NH2 
769 Ac - KFKQLLQHYREVAAAKS S EiJDRLRLLLK - NH2 

10 770 Ac - F KQLLQHYREVAAAKS S ENDRLRL LLKQ -NH2 

771 Ac -RAKFKQELQHYREVAAAKSSENDRLRLLLKQMCPS-NH2 

772 DKWASLWNWF - NH2 

77 3 Biotin-FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

774 Ac - YDASISQVNEKINQSLAFIRKSDELLHNVNAGKST-NH2 

775 AC-YDASISQVNEKINQSLAYIRKSDELLHNVNAGKST-NH2 

776 Ac - FDASISQVNEKINQSLAYIRKSDELLHNVNAGKST-NH2 

777 Ac -FDASISQVQEKIQQSLAFIRKSDELLHQVQAGKST-NH2 
15 778 Ac - FDASISQVNEKINQALAFIRKADELLHNVNAGKST-NH2 

779 AC-FDASISQVNEKINQALAFIRKSDELLHNVNAGKST-NH2 

780 Ac - FDAS ISQVNEKINQSLAFIRKADELLHNVNAGKST-NH2 

781 AC - YD AS I SQVQEE IQQALAF IRKADELLEQVQAGKST - NH2 
7 82 AC - FDAS ISQVNEKINQSLAF IRKS DELLENVNAGKST-NH2 
783 AC-FDASISQVNEEINQSLAFIRKSDELLHNVNAGKST-NH2 
7 84 Ac - VFPSDEFDASISQVNEKINQSLAFIRKSDELLENV-NH2 
785 AC-VFPSDEFDASISQVNEEINQSLAFIRKSDELLENV-NH2 
7 86 AC - VYPSDEYDAS ISQVNEE INQALAY IRKADELLENV- NH2 
7 87 Ac - VF PS DE FD AS ISQVNEE INQS LAF I RKSDELLHNV - NH2 
7 B 8 AC - SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQ-NH2 
789 Ac -WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEES-NH2 
7 9 0 AC - SWSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEE-NH2 
7 91 Ac - ASWSNKSLEQIWNNOTWMEWDREINNYTSLIHSLIE-NH2 
7 92 Ac -NAS WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLI -NH2 
7 93 AC - WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 
7 93 Ac - WNAS WSNKSLEQIWNNMTWMEWDREINNYTSLIHSL-NH2 
794 AC - PWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHS -NH2 
7 95 Ac - VPWNASWSNKSLEQIWNNMTWMEWDREINNYTSLIH-NH2 

7 96 AC -AVPWNASWSNKSLEQIWNNMTWMEWDREINNYTSLI -NH2 

797 AC-TAVPWNASWSNKSLEQIWNNMTWMEWDREINNYTSL-NH2 

798 AC - TTA VPWNAS WSNKS LEQ IWNNMTWMEWDRE INNYTS -NH2 

800 Ac -AAASDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

801 Ac- VFP AAA FDAS IS QVNEKINQS LAF I RKSDELLHNV -NH2 

8 02 Ac- VFPSDEAAASISQVNEKINQSLAFIRKSDELLHNV-NH2 
8 03 Ac - VFPSDEFDAAAAQVNEKINQSLAFIRKSDBLLHNV-NH2 
804 Ac- VFPSDEFDASISAAAEKINQSLAFIRKSDELLHNV-NH2 
8 0 5 AC - VFPSDEFDASISQVNAAANQSLAFIRXSDELLHNV-NH2 
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806 AC-VFPSDEFDASISQVNEKIAAAIAFIRKSDELLHNV-NH2 



807 AC-VFPSDEFDASISQVNEKINQSAAAIRKSDELLHNV-NH2 

808 AC-VFPSDEFDASISQVNEKINQSLAFAAASDELLHNV-NH2 

809 AC - VFPSDEFDAS ISQVNEKINQSLAFIRKAAALLHNV-NH2 

810 AC-VFPSDEFDASISQVNEKINQSLAFIRKSDEAAANV-NH2 

811 Ac - VFPSDEFDAS ISQVNEKINQS LAF IRKSDELLAAA - NH2 

812 Ac- WPS DEFDAS ISQVNEKINQS LAF IRKSDELLHNV-NH2 

813 AC-AAAAIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

814 AC-YTSLIHSLIEESQQQQEKNEQELLELDKWASLWNWF-NH2 

815 Ac - YTS L IHSLIEESQNQQEKQEQELLELDKWASLWNWF -NH2 

816 AC - QIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEKQ -NE2 

817 AC-QIWNNMTWMEWDREINNYTSLIHSLIEESQQQQEKN-NH2 

818 AC - Q I WNNMTWMEWDRE INNYTS L IHSL IEESQQQQE KQ - NH2 

819 AC-NKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQQ-NH2 
10 820 AC - FDASISQVNEKINQSLAFIEESDELLHNVNAGKST-NH2 

821 Ac -ACIRKSDELCL-NH2 

823 AC-YTSLIHSLIEESQNQQEKDEQELLELDKWASLWNWF-NH2 

824 AC - YTSLIHSLIEESQDQQEKNEQELLELDKWASLWNWF-NH2 

825 AC-YTSLIHSLIEESQDQQEKDEQELLELDKWASLWNWF-NH2 

826 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWDWF-NH2 

841 AC-LEANITQSLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

842 Ac - LEAK I S AS LEQAQI QQEKNMYELQKLNS WD VFTNWL - NH2 
15 843 AC - LEANISALLEQAQIQQEKNMYELQKLNSWD VFTNWL -NH2 

844 AC - LEANITALLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 

845 AC - LEANITASLEQAQIQQEKNMYELQKLNSWDVFTNWL-NH2 
q a c Ac- LEAN ITAS LEQAQI QQEKNMYELQKLNS WD VFTNWL -NH2 

AC -RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMUPS -NH2 
AC - Abu - DDE - Abu - MNS VKNGTYD YPKYEEE S KLNRNE I KG VKL - NH 2 
AC-WQEWEQKVRYLEANISQSLEQAQIQQEKNMYELQKL-NH2 

860 Ac -DEYDASISQVNEKINQSIAFIRKSDELLHNVNAGK-NH2 

2 0 861 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-NH2 

862 AC - YTSLIHSLIEESQNQQEKKEQELLELDKWASLW-NH2 

8 63 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASL-NH2 

864 Ac - YTSL IHSLIEESQNQQEKNEQELLELDKWAS -NH2 

AC -QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-NH2 
AC -DREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 
AC-NNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK-NH2 

868 Ac - VTSL IHSLIEESQNQQEKNEQELLELDKWASLWAAA - NH2 

2S 86 9 Ac- YTSLIHSLIEESQNQQEKNEQELLELDKWAAAANWF-NH2 

870 AC-YTSLIHSLIEESQNQQEKNBQELLELDAAASLWNWF-NH2 

871 AC - YTSLIHSL IEESQNQQEKNEQELLAAAKWASLWNWF -NH2 

872 AC - YTSLIHSLIEESQNQQEKNEQAAAELDKWASLWNWF-NH2 
8 7 3 Ac - YTSLIHSLIEESQNQQEKAAAELLELDKWASLWNWF -NH2 

874 AC-YTSLIHSLIEESQNQAAANEQELLBLDKWASLWNWF-NH2 

875 AC-YTSLIHSLIEESAAAQEKNEQELLELDKWASLWNWF-NH2 
8 7 6 AC - YTSLIHSLIAAAQNQQEKNEQELLELDKWASLWNWF -NH2 

877 AC-YTSLIHAAAEESQNQQEKNEQELLELDKWASLWNWF-NH2 

878 Ac - YTSAAASLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

879 AC-EIWNNMTWMEWDRENEKINQSLAFIRKSDELLHNV-NH2 

880 AC - YISEVNEEINQSLAFIRKADELLENVDKWASLWNWF -NH2 
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8 81 AC-TSVITIELSNIKENKANGTDAKVKLIKQELDKYKN-NH2 

882 YTS L IHSLI EESQNQQEKNEQELLELDKWASLWNWFMG - NH2 

883 Ac -NEKINQSLAFIRKSDELLHNV-NH2 

884 Biotin- YDPLVFPSDEFDAS I SQVNEKINQS LAF IRKSDEL - NH2 

885 Biotin- PLVFPSDEFDAS ISQVNEKINQSLAFIRKSDELLH -NH2 

886 Biotin-VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 
5 8 87 B i O t i n - DEFDAS I SQVNEKINQS LAF IRKSDELLHNVNAG K - NH2 

8 88 Biotin- VYPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

88 9 Biotin-VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

890 AC-VYPSDEFDASISQVQEEIQQALAFIRKADELLEQV-NH2 

891 AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

892 Ac -NNYTSL IHSLI EES QNQQEKNEQELLELDKWASLWNWF-NH2 

893 Ac -INNYTSL IHSLI EES QNQQEKNEQELLELDKWASLWNWF-NH2 
8 94 AC-EINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

10 895 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

896 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNI-NH2 

897 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNIT-NH2 

898 Ac -YTSL IHSLI EES QNQQEKNEQELLELDKWASLWNWFNITN-NH2 

899 Ac -YDPLVFPSDEFDAS I SQVNEKINQSLAFIRKSDELLHNVNAGK-NH2 

900 AC-NYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFN-NH2 

901 Ac -NNYTSLIHSLIEESQNQQEKNEQELLELDKMASLWNWFNI -NH2 

905 Ac - KCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS 1 1 PRTPD -NH2 

15 906 AC-RAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPRTPD-NH2 

907 Ac - VYPSDEYDAS ISQVNEEINQALAYIAAADELLENV-NH2 

90 9 AC -YD AS ISQVNEEINQALAYIRKADELL - NH2 

910 Ac - M - Nle - WMEWDRE INNYTSLIHSLIEESQNQQEKNEQELLEL - NH2 

911 AC - KNGTYDYPKYEEESKLNRNEIKGVKLSSMGVYQI -NH2 

912 AC - VTEKIQMASDNINDLIQSGVNTRLLTIQSHVQNYI -NH2 

913 QNQQEKNEQELLELDKWASLWNWF-NH2 

914 AC-QNQQEKNEQELLELDKWASLWNWF-NH2 
2 q 915 LWNWF-NH2 

916 ELLELDKWASLWNWF - NH2 

917 EKNEQELLELDKWASLWNWF -NH2 

918 SLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

919 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW 
92 0 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN 

921 Ac- YTSLIHSLIEESQNQQEKNEQELLELDKWASLW 

922 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASL 

92 3 TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

924 SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

92 5 LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

92 6 IHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

94 0 AC - AAVALLPAVLLALLAPSELEI KRYKNRVASRKCRAKF KQLLQHYREVAAAK- NH2 

941 Ac -AAVALLPAVLLALLAPCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCP -NH2 

942 AC-YTSLIHSLIEESQNQQEKNNNIERDWEMWTMNNWIQ-NH2 

944 VYPSDEYDAS ISQVNEEINQAI*AYIRKADELLENV-NH2 

945 Ac-LMQLARQLMUliARQMKQLADSLMQLARQVSRLESA-NH2 
30 94 6 AC-WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 

947 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

94 8 Ac -EWDREINNYTSL IHSLI EES QNQQEKNBQELLEL-NH2 
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94 9 Ac - MEWDRE INNYTS LIHSLIEES QNQQEKNEQELLE -NH2 



950 Biotin-W-Nle-EWDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

951 AC-YLEYDREINNYTSLIHSLIEESQNQQEKNEQELLEL-NH2 

952 AC-IKQFINMWQEVGKAMYA-NH2 

953 AC -IRKS DELL -NH2 

954 Decanoyl - IRKSDELL-NH2 

5 955 Acetyl -Aca-Aca-IRKSDELL-NH2 

956 AC-YDASISQV-NH2 

957 AC-NEKINQSL-NH2 

958 AC - S ISQVNEEINQALAYIRKADELL -NH2 

959 AC - QVNEE INQALAYIRKADELL -NH2 

960 Ac-EEINQALAYIRKADELL-NH 

961 AC - NQALAYIRKADELL - NH2 

962 AC - LAYIRKADELL-NH2 

10 963 PDAS I S Q VNEK INQ ALAF I RXSDELL - NH2 

964 Ac-W-Nle- EWDRE INNYTSL IHSLIEES QNQQEKNEQELLEL - NH2 

965 AC -ASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS -NH2 

967 Ac - WLEWDREINNYTSLIHSLIEESQNQQEKNEQELLEL - NH2 

968 Ac - YVKGEPIINFYDPLVFPSDEFDASISQVNEKINQSL-NH2 

969 AC - VYPSDEYDASISQVNEEINQSLAYIRKADELLHNV-NH2 

970 AC-YDASISQVNEEINQALAYIRKADELLENV-NH2 

971 AC-YDASISQVNEEINQALAYIRKADELLE-NH2 

15 972 AC-VYPSDEYDASISQVNEEINQALAYIRKAAELLHNV-NH2 

973 AC-VYPSDEYDASISQVNEEINQALAYIRKALELLHNV-NH2 

974 Decanoyl -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

975 AC-VYPSDEYDASISQVNBEINQLLAYIRXLDELUSNV-NH2 
97 6 Ac - DEYDAS I S QVNEKINQSLAFIRKSDELL - NH2 

977 AC - SNDQGSGYAADKESTQKAFDGITNKVNSVIEKTNT-NH2 

978 AC -ESTQKAFDGITNKVNSVIEKTNTQFEAVGXEFGNLEKR-NH2 
97 9 AC - DG ITNKVN S VTEKTNTQFEAVGKEFGNLEKRLENLNK - NH2 

2 0 980 Ac -DSNVKNLYDKVRSQLRDNVKELGNGAFEFYHK-NH2 

981 Ac - RDNVKELGNGAFEFYHKADDEALNSVKNGTYDYPKY - NH2 

982 Ac - E FYHKADDEALNS VKNGTYD YP KY-NH2 

983 Ac -AAVALL PA VLLAL LAP AADKESTQKAFDG ITNKVN S -NH2 

984 AC - AAVALLPAVLLALLAPAADSNVKNLYDKVRSQLRDN -NH2 

985 AC-KESTQKAFDGITNKVNSV-NH2 

986 AC - I E KTNTQFEAVG KEFGNLER - NH2 

987 AC - RLENLNKRVEDGFLDVWTYNAELLVALENE - NH2 
25 9B8 Ac - SNVKNLYDKVRSQLRDN-NH2 

989 Ac - WMEWDREINNYTSLIHSLIEESQNQQEKNEQEL-NH2 

990 Ac -WMEWDRE INNYTSL IHSLIEES QNQQEKNEQE-NH2 

991 Ac - MEWDRE INNYTSL IHS LI EESQNQQEKNEQEL-NH2 

992 AC - MEWDRE INNYTSL IHSLIEESQNQQEKNEQE -NH2 

993 AC - EWDRE INNYTSL IHS L IEESQNQQEKNEQELLE -NH2 

994 AC -EWDRE INNYTS L IHS LI EES QNQQEKNEQELL-NH2 

995 AC - EWDRE INNYTSLIHSLIEESQNQQEKNEQEL -NH2 

996 AC -YTKFIYTLLEESQNQQEKNEQELLELDKWASLWNWF-NH2 
JU 997 Ac - YMKQLADSLMQLARQVSRLESA-NH2 

9 98 Ac -YLMQLARQMKQLADSLMQLARQVSRLESA-NH2 

999 AC - YQEWERKVDFLEENITALLEEAQIQQEKNMYELQKL-NH2 
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1000 AC-WMAWAAAINNYTSLIHSLIEESQNQQEKNEQEEEEE-NH2 

1001 AC - YASLI AALIEESQNQQEKNEQELLELAKWAALWAWF -NH2 

1002 [AC-EWDREINNYTSLIHSLIEESQNQQEKNEQEGGC-NH2] dimer 

1003 AC-YDISIELNKAKSDLEESKEWIKKSNQKLDSIGNWH-NH2 

1004 B lot inyl - IDISIELNKAKSDLEESKEWIKKSNQKLDS IGNWH -NH2 

1005 Ac-YTSLI-OH 

1006 Fmoc-HSLIEE-OH 

1007 Fmoc - SQNQQEK- OH 

1008 Fmoc -NEQELLEL-OH 

1009 Fmoc-DKWASL-OH 

1010 FmOC-WNWF-OH 

1011 Ac - AKTLERTWDTLNHLLFISSALYKLNLKSVAQITLSI -NB2 

1012 Ac -NITLQAKIKQFINMWQEVGKAMYA-NH2 

1013 AC - LENERTLDFHDS NVKNL YDKVRLQLRDN - NH2 

1014 AC - LENERTLDFHDSNVKNL YDKVRLQLRDNVKELGNG -NH2 

1015 AC - TLDFHDSNVKNL YDKVRLQLRDNVKELGNGAFEF - NH2 

1016 AC - IDIS IELNKAKS DLEESKEWIKKSNQKLDS IGNWH -NH2 

1021 Biot inyl - S ISQVNEE INQALAYIRKADELL -NH2 

1022 Biot inyl -SISQVNEEINQSLAYIRKSDELL-NH2 
102 3 AC - S ISQVNEE INQSLAYIRKSDELL -NH2 

1024 AC-IDISIELNKAKSDLEESKEWIEKSNQELDSIGNWE-NH2 

1025 Ac - IDISIELNKAKSDLEESKEWIKKSNQELDSIGNWH -NH2 

1026 Ac- IDISIELNKAKSDLEEAKEWIDDANQKLDSIGNWH-NH2 
102 7 AC - IDIS IELNKAKSDLEES KEWIKKANQKLDS IGNWH -NH 2 
102 B AC - IDISIELNKAKSDLEEAKEWIKKSNQKLDSIGNWH-NH2 

102 9 Biot inyl -NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL-NH2 

103 0 Biot inyl -ALDPIDISIELNKAKSDLEESKEWIKKSNQKLDSI -NH2 

1031 desAminoTyrosine-NSVALDPIDISIELNKAKSDLEESKEWIKKSNQKL-NH2 

1032 de sAminoTyros ine r ALDPIDIS IELNKAKSDLEES KEWIKKSNQKLDS I -NH2 

1033 AC - YDAS I SQVNEE INQALAF IRKADEL - NH2 

1034 AC - YDAS ISQVNEE INQS LAYIRKADELL - NH2 

1035 Biot iny 1 - YDAS I SQVNEE INQALAYIRKADELL - NH 2 

1036 Biot inyl- YDAS ISQVNEEINQSLAFIRKSDELL-NH2 

1037 AC - YDAS I S QVNEE INQSLAF XRKSDELL - NH2 

10 3 8 Ac - WLEWDRE I NNYTS L IHS L IEESQNQQEKNEQEL -NH2 

103 9 Biot inyl- IDIS IELNKAKSDLEESKEWIRRSNQKLDS IGNWH -NH2 

1044 AC-YESTQKAFDG.ITNKVNSVIEKTNTQFEAVGKEFGNLEKR-NH2 

1045 Biotin-DEYDASISQVNEKINQSLAFIRKSDELL-NH2 

104 6 AC-MEWDREINNYTSLIHSLIEESQNQQEKNEQELL-NH2 
104 7 Ac - WQEWEQKVRYLEANI S QS LEQAQ IQQEKNMYEL - NH2 
104 8 Ac - WQEWEQKVRYLEANI SQSLEQAQIQQEKNEYEL-NH2 
104 9 AC-WQEWEQKVRYLEANITALLEQAQIQQEKNEYEL-NH2 

1050 AC - WQEWEQKVRYLEANITALLEQAQ IQQEKNMYEL -NH2 

1051 Ac - WQEWEQKVRYLEANISQSLEQAQIQQEKNEYELQKL-NH2 

1052 Ac - WQEWEQKVRYLEANITALLEQAQ IQQEKNEYELQKL -NH2 

1053 Ac - WQEWEQKVRYLEANITALLEQAQ IQQEKNMYELQKL -NH2 

1054 AC-IDISIELNKAKSDLEESKEWIEKSNQKLDSIGNWH-NH2 

1055 AC - EFGNLEKRLENLNKRVEDGFLDVWTYNAELLVALENE -NH2 

1056 Ac -EDGFLDVWTYNAELLVLMENERTLDFHDSNV1CNLYDKVRMQL-NH2 

1057 Ac - SI SQVNEKINQSLAFIRKSDELL -NH2 
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105 8 des aminoTyr- S I SQVNEKINQSLAFIRKSDELL -NH2 

105 9 Ac - S I SQVNEKINQSLAYIRKSDELL -NH2 

1060 Ac - QQL LD WKRQQEMLRLTVWGTKNLQAR VTA I E KYL KDQ-NH2 

1061 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFC 

1062 Ac - FDASISQVNEKINQSLAYIRKSDELL-NH2 

1063 AC - YTSLIHSLIEESQNQQEKNEQELLELDKWA 

1064 Indole - 3 - acetyl -DEFDAS IS QVNEKINQS LAFIRKS DELL -NH2 

1065 Indol e - 3 -acetyl -DEFDESISQVNEKINQSLAFIRKSDELL -NH2 

1066 Indole -3 -acetyl OEFDESISQVNEKIEQSLAFIRKSDELL-NH2 
10 67 Indole - 3 - acetyl -DEFDESISQVNEKIEESLAFIRKSDELL-NH2 
1068 Indole - 3 - acetyl -DEFDESISQVNEKIEESLQFIRKSDELL-NH2 

106 9 Indole -3 -acetyl -GGGGGDBFDASISQVNEKINQSLAFIRKSDELL-NH2 
1070 2 - Nap t hoy 1 - DEFDAS ISQVNEKINQSLAF IRKSDELL -NH2 

, 1071 de SNH2 Tyr - DEFDAS ISQVNEKINQSLAFIRKSDELL -NH2 

10 1072 biot in -ALDPIDIS IELNKAKSDLEESKEWIRRSNQKLDSI -NH2 

1073 Ac - YDAS ISQVNEKINQA1AYIRKADELLHNVNAGKST-NH2 

1074 AC - VYPSDEYDAS ISQVNEKINQALAYIRKADELLHNV-NH2 

1075 AC-VYPSDEYDASISQVNEKINQSLAYIRKSDELJLHNV-NH2 

1076 Ac - WG WGYGYG - NH2 

1077 AC - YG WGWGWGF -NH2 

1078 Ac - WQEWEQKVRYLEANITALQEQAQIQAEKAEYELQKL - NH2 

1079 Ac - WQEWEQKVRYLEAEITALOEEAQIOAEKAEYELQKL -NH2 
, 15 I0B1 Ac-YTSLIHSLIEESQNQQEKNEQELLELDKWAS 

10B2 AC-VWPSDEFDASISQVNEKINQSLAFIRKSDELLHNV-NH2 

1083 Ac - SKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWGV-NH2 

1084 AC-LSKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDWG-NH2 

1085 AC-DLSKNISEQIDQIKKDEQKEGTGWGLGGKWWTSDW-NH2 

1086 Ac -EDLSKNISEQIDQIXKDEQKEGTGWGLGGKWWTSD-NH2 

1087 Ac - IEDLSKNISEQIDQIKKDEQKEGTGWGIrGGKWWTS -KH2 
10 88 Ac -GIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWWT-NH2 

20 1 0 8 9 AC - IGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKWW-NH2 

1090 2 -Napthoyl - -PSDEFDASISQVNEJCINQS1AFIRKSDELLHNVN-NH2 

1091 AC-VYPSDEYDASISQVNEKINQALAYIRKADELLENV-NH2 

1092 AC-VYPSDEFDASISQVNEKINQALAFIRKADELLENV-NH2 

1093 AC " VYPSDEYDAS IS QVNEKINQALAY IREADELLENV - NH2 

1094 Biotinyl-YDASISQVNEKINQSIAFIRESDELL-NH2 

1095 AC-AIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGKW-NH2 

1096 AC-AAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLGGK-NH2 
2 5 10 97 AC-DAAIGIBDLSKNISEQIDQIKKDEQKEGTGWGLGG-NH2 

1098 AC - PDAAIGIEDLSKNISEQIDQIKKDEQKEGTGWGLG-NH2 

1099 AC -NITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQWI -NH2 

1100 Ac - KNITDKIDQI IHDFVDKTLPDQGDNDNWWTGWRQW-NH2 

1101 Ac -TKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQ-NH2 

1102 AC-WTKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWR-NH2 

1103 AC - DWTKNITDKIDQ I IHDFVDKTLPDQGDNDNWWTGW-NH2 

1104 AC - HDWTKN ITDKIDQ I IHDFVDKTLPDQGDNDNWWTG -NH2 

1105 AC-PHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWT-NH2 

1106 AC-EPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWW-NH2 

1107 AC-IEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNW-NH2 
110 8 AC -AIEPHDWTKN ITDKIDQ IIHDFVDKTLPDQGDNDN-NH 2 
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1109 AC-AAIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDND-NH2 

1110 AC-DAAIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDN-NH2 

1111 Ac - L S PTVWLS V I WMMWYWG P S L YS ILS P FL PLL P IFF - NH2 

1112 AC-GLSPTVWLSVIWMMWYWGPSLYSILSPFLPLLPIF-NH2 

1113 AC-VGLSPTVWLSVIWMMWYWGPSLYSILSPFLPLLPI-NH2 

1114 AC-FVGLSPTWLSVIWMMWYWGPSLYSILSPFLPLLP-NH2 
5 1115 Ac - WFVGLSPTVWLS VIWMMWYWGPSLYS ILSPFLPLL -NH2 

1116 Ac -QWFVFLSPTVWLS VIWMMWYWGPSLYS I LSPFLPL-NH2 

1117 Ac -VQWFVGLS PTVWLS VIWMMWYWGPSLYS ILSPFLP -NH2 
1 1 1 B Ac - FVQWFVGLSPTVWLSVI WMMWYWGPSLYS ILSPFL - NH2 

1119 Ac - PF VQWFVGLS PTVWLSVI WMMWYWGPSLYS ILS PF - NH2 

1120 Ac-VPFVQWFVGLSPTVWLSVIWMMWYWGPSLYS ILSP-NH2 

1121 AC-LVPFVQWFVGLSPTVWLSVIWMMWYWGPSLYSILS-NH2 

1122 H -NHTTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKW - OH 

10 1123 H- QARQLL SGI VQQQNNLLRAI EAQQHLLQLTVWGIKQLQAR I LAVERYLKDQ -OH 

1124 Ac - VYPSDEFDAS ISQVNEKINQSLAFIREADELLENV-NH2 

1125 Ac - VFPSDEFDASISQVNEKINQSLAYIREADELLENV-NH2 
112 6 AC-DEFDASISQVNEKINQSLAYIREADELL-NH2 

1127 AC-NEQELLELDKWASLWNWFGGGGDEFDASISQVNEKINQSLAFIRKSDELL-NH2 

112 8 AC - LELDKWASLWNWFGGGGDEFDAS I SQVNEKINQSLAF IRKSDELL -NH2 

112 9 Naphthoyl -EGEGEGEGDEFDAS ISQVNEKINQSLAFIRKSDELL-NH2 

113 0 AC-ASRKCRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDV-NH2 
15 1131 Naph t hoy 1 - GDEEDAS I S QVNEKINQSLAF IRKSDELL -NH2 

1132 Naphthoyl- GDEEDAS ESQVNEKINQSLAF IRKSDELL -NH2 

113 3 Naphthoyl - GDEEDAS ESQQNEKINQSLAF IRKSDELL -NH2 

1134 Naphthoyl - GDEEDAS ESQQNEKQNQSLAF IRKSDELL -NE2 

113 5 Naphthoyl - GDEEDAS E S QQNEKQNQSEAF IRKS DELL - NH2 

113 6 AC - WGDE FDES ISQVNEKIEESLAFIRKSDELL - NH2 

113 7 AC-YTSLGGDEFDESISQVNEKIEESLAFIRKSDELLGGWNWF-NH2 

113 8 AC-YTSL IHSLGGDEFDES I SQVNEKIEESLAFIRKSDELLGGWASLWNWF - NH 

113 9 2 -Naphthoyl -GDEFDESISQVNEKIEESLAFIRKSDELL-NH2 

1140 2 -Naphthoyl -GDEEDES I SQVNEKIEESLAFIRKSDELL-NH2 

1141 2 -Naphthoyl- GDEEDES ISQVQEKIEESLAFIRKSDELL-NH2 

1142 2 -Naphthoyl -GDEEDES ISQVQEKIEESLLFIRKSDELL-NH2 

114 3 Biot in - GDEYDES ISQVNEKIEESLAFIRKSDELL -NH2 
114 4 2 -Naphthoyl -GDEYDES IS QVNEKIEESLAF IRKSDELL -NH2 
114 5 AC - YTSLIHSLIDEQEKIEELAFIRKSDELLELDKWNWF -NH2 
114 6 VYPSDEYDAS ISQVNEE INQALAYIRKADELLENV -NH2 
1147 AC - NNLLRAIEAQQHLLQLTVWGSKQLQARILAVERYLKDQ -NH2 
114 8 GGGVYPSDEYDAS ISQVNEEINQALAYIRKADELLENV-NH2 

114 9 Ac -NNLLRAIEAQQHLLQLTVWGEKQLQARILAVERYLKDQ-NH2 

1150 Ac - PTRVNYILIIGVLVLAbuEVTGVRADVHLL-NH2 

1151 AC - PTRVNYILI IGVLVLAbuEVTGVRADVHLLEQPGNLW - NH2 

1152 AC - PEKTPLLPTRVNYILI IGVLVLAbuEVTGVRADVHLL -NH2 

1153 Ah aGGGVYPSDEYDAS ISQVNEE INQALAYIRKADELLENV- NH2 

1155 AC-YTSLIHSLGGDEFDESISQVNEKIEESLAFIRKSDELL-NH2 

1156 Ac -YTSLGGDEFDES ISQVNEKIEESLAFIRKSDELL -NH2 

30 xi 57 AC -DEFDESISQVNEKIEESLAFIRKSDELLGGWASLWNWF-NH2 
1158 AC-DEFDESISQVNEKIEESLAFIRKSDELLGGWNWF-NH2 

115 9 AC - YTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF -NH2 



20 
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1160 Ac - YTSLIHSLIEESQNQQEKNEQBLLELDKSLWNWF -NH2 — " 

1161 Ac - YTS LIHSLIEESQNQQEKNEQELLELDKLWNWF -NH2 

1162 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWNWF -NH2 

1163 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKASLWNWF-NH2 

1164 AC-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKSLWNWF-NH2 

1165 Ac-MTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELI)KLWNWF-NH2 

1166 AC - MTWMEWDRE INNYTSLIHSLI EESQNQQEKNEQELLELDKWNWF -NH2 

1167 AC - MTWMEWDRE INNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWN - NH2 

1168 Ac - MTWMEWDRE 1NNYTSL IHS L IEES QNQQEKNEQELLELDKWASL - NH2 

1169 ( Pyr ) HWS Y ( 2 -nap thyl -D- Ala ) LRPG-NH2 

117 0 AC-WNWFDEFDESISQVNEKIEESLAFIRKSDELLWNWF-NH2 

1171 Ac - YTSLIHSL IEESQNQQEKNEQELLELDKYASLYNYF - NH2 

1172 AC - YTSLIHSLIEESQNQQEKNEQELLELDKYAYLYNYF -NH2 

1173 2 -Naphthoyl -AcaAcaAcaDEFDESISQVNEKIEESLAFIRKSDELLAcaAcaAcaW-NH2 
10 1174 2 - Naphthoyl -AcaAcaAcaGDEFDESISQVNEKIEESLAFIRKSDELLGAcaAcaAcaW-NH2 

1175 2 -Naphthoyl -GDEFDES I SQVNEKI EES LAFIRESDELL-NH2 

1176 2 -Naphthoyl - GDEFDES ISQVNEKIEESLAFIEESDELL -NH2 

1177 AC-WQEWEQKVNYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1178 Ac-WQEWEQKVDYLEANITALLEQAQIQQEKNEYEI*QKIi-NH2 

1179 AC-WQEWEQKVRWLEANITALLEQAQIQQEKNEYELQKL-NH2 

1180 AC-WQEWEKQVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1181 Ac - WQEWEHQVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 
15 H82 AC-WQEWEHKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1183 Ac - WQEWDREVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1184 Ac - WQEWEREVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1185 Ac - WQEWERQVRYLEANITALLEQAQIQQEKNEYELQKL -NH2 

1186 Ac - WQEWEQKVKYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1187 Ac -WQEWEQKVRFLEANITALLEQAQIQQEKNE YELQKL -NB2 

1188 Ac-VNalPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 
118 9 Ac-VNalPSDENalDASISQVNEEINQALAYIRKADELLENV-NH2 

20 1190 AC - VNal PSDEYDAS I SQVNEEINQALANal IRKADELLENV- NH2 

1191 Ac - VYPSDEFDASISQVNEKINQSLAFIREADELLFNFF-NH2 

1192 Ac - VYPSDE YDAS I SQVNEE INQALAYIRKADELLFNFF - RH2 

1193 Ac - YTS L ITALLEQAQ IQQEKNEYELQKLDKWAS LWNWF - NH2 

1194 Ac - YTS LITALLEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 

1195 AC - YTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1196 Ac - YTS L ITALLEQAQ IQQEKNE YELQELDE WAS LWE WF - NH2 

1197 Ac - YTS LITALLEEAQ IQQEKNE YELQELDEWASLWEWF-NH2 

2 5 1198 Naphthoyl - Aua - Aua -Aua-TALLEQAQIQQEKNEYELQKLAua - Aua - Aua -W-NH2 
1199 AC - WAAWEQKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 
12 00 AC- WQEAAQKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 
12 01 Ac -WQEWAAKVRYLEAN ITALLEQAQ IQQEKNE YELQKL -NH2 

1202 Ac - WQAAEQKVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 

1203 AC - WQEWEAAVRYLEANITALLEQAQIQQEKNEYELQKL-NH2 
12 04 Ac - WQE WEQAARYLEANITALLEQAQIQQEKNEYELQKL-NH2 
1205 AC - WQEWEQKAAYLEANITALLEQAQIQQEKNE YELQKL -NH2 
12 06 Ac -WQEWEQKVAALEAN ITALLEQAQ IQQEKNE YELQKL -NH2 

1207 AC - WQEWEQKVRYLEANITALLEQAQIQQEKNEYELQKLGGGGWASLWNF-NH2 

1208 2 -Naphthoyl -GDEFDASISQVNEKINQSLAFIRKSDELT-NH2 

1209 2 -Naphthoyl -GDEFDASISQVNEKINQSLAFTRKSDELT-NH2 
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1210 2-Naphthoyl-GDEFDASISQVNEKTNQSLAFTRKSDELT-NH2 

1211 2 - Naph t hoy 1 - GDEFDAS I S QTNE KTNQS LAFTRKSDELT - NH2 

1212 2 - Naph t hoy 1-GDEFDASTS QTNE KTNQS LAFTRKSDELT -NH2 

1213 2 - Naph t hoy 1-GDE YD ASTSQTNEKTNQS LAFTRKSDELT -NH2 

1214 2 -Naphthoyl-GDEFDEEISQVNEKIEESLAFIRKSDELL-NH2 . 

1215 2 -Naphthoyl -GDEFDASISQVNEKINQSLAFIRKSDELA-NH2 
5 1216 2 -Naphthoyl - GDEFDAS AS QANEKANQSLAFARKSDELA-NH2 

1217 2 - Naph t hoy 1 - GDEFDES I S QVNEKI EE S LAFTRKS DELL - NH2 

1218 2 -Naph thoyl -GDEFDES I SQVNEKTEESLAFIRKS DELL -NH2 

1219 2 - Naph t hoy 1 - GDEFDES I SQTNEKIEESLAFIRKSDELL - NH2 

1220 2 -Naphthoyl -GDEFDESTSQVNEKIEESLAFIRKSDELL-NH2 

1221 Ac - WNWFDEFDESTSQVNEKIEESLAFIRKSDELLWNWF-NH2 

1222 AC-WNWFDEFDESTSQTNEKIEESLAFIRKSDELLWNWF-NH2 

1223 Ac - WNWFDEFDESTSQTNEKTEESLAFIRKSDELLWNWF -NH2 
10 1224 Ac - LQAGFFLLTRILTIPQSLDSWWTSLNFLGGTTVAL-NH2 

122 5 Ac - YTNLIYTLLEESQNQQEKNEQELLELDKWASLWSWF-NH2 

1226 Ac - WQE WEQKVRYLEANITALLEQAQ I QQE KNE YELQKLD KWAS LWNWF - NH2 

1227 AC -NNPHT^QE WEQKVRYLEANITALLEQAQ I QQEIOJEYELQKLDKWAS LWNWF -NH2 

123 0 Ac- WNWFIEESDELLWNWF-NH2 

1231 2 -Naph thoyl -GFIEESDELLW-NH2 

1232 Ac - WFIEESDELLW-NH2 

123 3 2 - Naph thoyl -GFNFFIEESDELLFNFF-NH2 
15 1234 2 -Naph thoyl -GESDELW-NH2 

123 5 Ac - WNWFGDEFDESISQVQEEIEESLAF IEESDELLGGWNWF -NH2 
123 6 Ac - WNWFIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1237 Ac - YTSLITALLEQAQIQQEENEYELQALDEWASLWEWF - NH2 

1238 Ac - YTSLIHSLGGDEFDES 1 SQVNEEIEESLAFIEESDELLGGWASLWNWF -NH2 

123 9 2 - Naph thoyl- GDEFDES ISQVQEE I EESLAFIEESDELL-NH2 

124 0 H - QARQLLSSIMQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ- OH 
1241 Ac - CPKYVKQNTLKLATGMRNVPEKQTR-NH2 

2 o 1242 AC-GLFGAIAGFIENGWEGMIDGWYGFRHQNSC-NH2 
124 3 Ac - LNFLGGT - NH2 
124 4 AC - LDS WWTS LNFLGGT -NH2 
124 5 Ac - I LT I PQSLDS WWTSLNFLGGT - NH2 
124 6 Ac -GFFLLTRILTI PQSLDS WWTSLNFLGGT -NH2 
124 7 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWNWF -NH2 
124 8 Ac - WNWFITALLEQAQIQQEKNEYELQKLDKWASLWNWF -NH2 
124 9 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 
12 50 Ac - WQEWEQKVRYLEANITALLEQAQIQQEKIEYELQKL-NH2 
12 51 Ac - WQEWEQKVRYLEAQITALLEQAQIQQEKIEYELQKL -NH2 

1252 Ac - KENKANGTDAKVKLIKQELDKYTOJAVTELQLLMQS -NH2 

1253 AC - N I KENKANGTDAKVKL I KQELDK YKNAVTELQLLM -NH2 

1254 (FS) - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1255 2-Naphthoyl-GWNWFAcaDEFDESISQVQEEIEESLAFIEESDELLAcaWNWF-NH2 

1256 Ac -WNWFGDEFDESISQVNEKIEESLAFIEESDELLGWNWF-NH2 

1257 Ac -WNWFGDEFDESISQVNEKIEESLAFIRKSDELLGWNWF-NH2 

1258 AC ~*^ F ~ Aca ~ DEFDESISQWEKIEES ^ FI ^^ 

1259 Ac - WNWF - Aca -DEFDES IS QVNEKI EESLAFIEESDELL - Aca - WNWF-NH2 

1260 Ac - EESQNQQEKNEQELLELDKWA-NH2 

1261 EESQNQQEKNEQELLELDKWA 
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12 62 Ac - CGTTDRSGAPT YS WGANDTD VFVLNNTRPPLGNWFG - NH2 ™ ~ 

1263 Ac - G VEHRLEAACNWTRGERADLEDRDRSELS P - NH2 

1264 Ac - CVREGNASRAWVAVTPTVATRDGKLPT -NH2 

1265 Ac - CFSPRHHWTTQDANAS IYPG -NH2 

1266 Ac-LQHYREVAAAKSSENDRLRLLLKQMCPSLDVDS -NH2 

1267 AC-WQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWF-NH2 

1268 AC-CWQEWDREISNYTSLITALLEQAQIQQEKNEYELQKLDEWASLWEWFC-NH2 

1269 AC - WQEWDRE ISNYTSL ITALLEQAQ I QQEKNE YELQKLDEWEWF - NH2 

1270 Ac - CWQEWDREI SNYTS L ITALL EQAQ I QQE KNE YELQKLDE WE WFC - NH2 

1271 AC -GQNSQSPTSNHSPTSAPPTAPGYRWA-NH2 

1272 Ac - PGS STTSTGPARTALTTAQGTSLYPSA -NH2 

1273 Ac -PGSSTTSTGPARTALTTAQGTSLYPSAAATKPSDGNATA-NH2 

1275 AC-WQEWDREITALLEQAQIQQEKNEYELQKLDKWASLWNWF-NH2 

1276 Ac - WQEWDRE I TALLEQAQ I QQE KNE YELQKLDE WAS LWEWF - NH2 
10 1277 Ac - WQEWDRE ITALLEQAQIQQEKNEYELQKLDEWEWF-NH2 

1278 Ac - WQEWD REITALLEQAQIQQEKNE YELQKLDEWEWF -NH2 

1279 Ac - WQEWERE ITALLEQAQIQQEKNEYELQKL I EWEWF -NH2 
128 0 Ac - WQEWERE ITALLEQAQ IQQEKIEYELQKLDEWEWF-NH2 
12 81 Ac - WQEWEITALLEQAQIQQEKNEYELQKLDEWEWF-NH2 
12 82 Ac - WQEWEITALLEQAQIQQEKNEYELQKLIEWEWF -NH2 
12 83 AC - WQEWE ITALLEQAQ IQQEKIEYELQKLDEWEW F - NH2 
12 84 Ac - WQEWEITALLEQAQIQQEKIEYELQKLIEWEWF-NH2 

15 1285 Ac - WQEWDRE IDEYDAS I SQVNEKINQALA YI READELWE WF - NH2 
1286 Ac - WQEWERE IDEYDAS XSQVNEKINQALAYIREADELWEWF - NH2 
12 87 AC - WQEWEIDEYDAS IS QVNEKINQALAY IREADELWEWF - NH2 
128 8 Ac - WQEWDRE IDEYDAS IS QVNEEINQALAYI READELWEWF -NH2 

1289 Ac - WQEWERE IDEYDAS I SQVNEEINQALAYI READELWEWF -NH2 

1290 Ac - WQEWE IDEYDAS I SQVNEEINQAIiAYIREADELWEWF - NH2 
12 91 Ac - WQEWDEYDASISQVNEKINQALAYIREADELWEWF -NH2 
12 92 AC - WQEWDEYDAS ISQVNEE INQALAY I READS LWEWF -NH2 

2 0 12 93 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLIEWEWF-NH2 

1294 AC - WQEWEQKITALLEQAQIQQEKIEYELQKLIEWASLWEWF -NH2 
1 2 95 Ac - WQEWE ITALLEQAQ IQQBKIEYELQKLIEWASLWEWF -NH2 
12 98 - VYPSDEYDAS I SQVNEEINQALAYIRKADELLENV-NH2 

12 99 Ac - WVYPSDEYDASISQVNEEINQALAYIRKADELLENVWNWF -NH2 
1300 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

13 01 Ac - WQEWDEYDAS ISQVNEKINQALAYIREADELWAWF -NH2 
13 02 AC - WQAWDE YDAS ISQVNEKINQALAYIREADELWAWF - NH2 

2 5 13 03 Ac -WQAWDE YDAS I SQVNEKINQALAYIREADELWEWF-NH2 

1304 Biotin- YDPLVFPSDEFDAS ISQVNEKINQSLAFIRKSDEL -NH2 

1305 Biotin - YDPLVFPSDEFDAS ISQVNEKINQSLAF -NH2 
13 06 Biotin- QVNEKINQS LAF I RKS DELLHNVNAGKST - NH2 

1307 AC-WMEWDREI-NH2 

1308 Ac - WQEWEQKI -NH2 

13 0 9 Ac - WQEWEQKITALLEQAQIQQEKIEYELQKLIKWASLWEWF-NH2 

1310 Ac - WQEWEQKITALLEQAQ IQQEKIEYELQKLIEWAS LWEWF - NH2 

1311 AC " WQEWEREISAOTSLIT ^ LE Q A Q I QQ EKIEYEL QKI : 'IEWEWF-NK2 

1312 Ac - WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKEWEWF -NH2 

1313 Ac - WQEWERE I S AYTSLITALLEQAQ IQQEKIE YELQKEWE W - NH2 

1314 AC-WQEWEREISAYTSLITALLEQAQIQQEKIEYELQKLIEWEW-NH2 
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1315 AC - FNLSDHSES IQKKFQLMKKHVNKIGVDSDPIGSWLR - NH2 

1316 AC - DHSES IQKKFQLMKKHVNKIGVDSDP IGSWLRG IF- NH2 

1317 AC - WS VKQANLTTSLLGDLLDDVTS IRHAVLQNRA-NH2 

1318 Biotin-WMEWDREI-NH2 

1319 Biot in-NNMTWMEWDREINNYTSL-NH2 

132 0 Ac -GAASLTLTVQARQLLSGIVQQQNNLLRAIEAQQHLL -NH2 
5 1321 AC - AS LTLTVQARQLLSG I VQQQNNLLRAI EAQQHLLQL -NH2 

1322 AC - VS VGNTLYYVNKQEGXSLYVKGEPIINFYDPLVF -NH2 

1323 AC-QHWSYGLRPG-NH2 

1324 AC -WQEWEQKIQHWSYGLRPGWASLWEWF-NH2 

1325 AC - WQEWEQKIQHWSYGLRPGWEWF-NH2 

1326 AC - WNWFQHWS YGLRPGWNWF -NH2 

1327 AC-FNFFQHWSYGLRPGFNFF-NH2 

1328 Ac - GAGAQHWS YGLRPGAGAG -NH2 

10 1329 PLLVLQAGFFLLTRXLT I PQSLDS WWTS LNFLGGT 

133 0 AC - WQEWEQKITALLEQAQIQQEKIEYELOKIiAKWASLWEWF -NH2 

1331 Ac -WQEWEQKITALLEQAQIQQEKIEYELQKLAEWASLWEWF-NH2 

1332 Ac -WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWEWF-NH2 
13 33 AC - WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWAWF -NH2 
1334 AC - WQEWEQKITALLEQAQIQQEKAE YELQKLAKWASLWAWF - NH2 
13 35 Ac -TNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK-NH2 
1336 AC - KAWS L SNGVS VLTS KVLDLKNYID KQLLP IVNKQS -NH2 

15 1337 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLIEWEWF-NH2 

1338 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIEWEWF-NH2 

1339 AC-WQEWEQKITALLEQAQIQQEKIEYELQKLDKWEWF-NH2 

1340 AC-YDPLVFPSDEFDASISQVNEKINQSLAF-NH2 

1341 Fluor- -VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1342 Fluor - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1344 AC - SG I VQQQNNLLRAI EAQQHLLQLTVWG I KQLQARIL - NH2 

1345 AC - QQQNNLLRAIEAQQHLLQLTVWG IKQLQARILAVERYLKDQ - NH2 

1346 AC - SGI VQQQNNLLRAI EAQQHLLQLTVWG IKQLQARILAVERYLKDQ -NH2 
13 4 7 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLAEWASLWAWF -NH2 

134 8 AC-WQEWEQKITALLEQAQIQQEKNEYELQKLAEWASLWAW-NH2 
134 9 Ac - WQEWEQKITALLEQAQIQQEKAEYELQKLAEWASLWAW- NH2 
13 5 0 AC - WQE WEQKITALLEQAQ IQQEKNE YELQKLAEWAGLWAWF -NH2 
1351 AC - WQEWEQKITALLEQAQIQQEKNE YELQKLAEWAGLWAW -NH2 
13 52 AC -WQEWEQKITALLEQAQIQQEKAE YELQKLAEWAGLWAW- NH2 

1353 Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLDKWAGLWEWF -NH2 

1354 AC - WQE WQHWS YGLRPG WEWF - NH2 
13 55 Ac - WQA WQHWS YGLRPGWAWF - NH2 

13 56 Biot inyl - WQEWEQKITALLEQAQIQQEKNE YELQKLDKWASLWEWF-NH2 

13 57 W QE WEQKITALLEQAQ IQQEKNE YE LQKLDKWASL WEWF 

13 5 8 WQEWEQKITALLEQAQIQQEKIEYELQKLIEWEWF 

13 61 AC -AGSTMGARSMTLTVQARQLLSGIVQQQNNLLRAIEAQQ-NH2 

13 62 Ac - AGS AMG AAS LTL S AQS RTLLAG I VQQQQQLLD WKRQQ - NH2 

13 63 Ac -AGSAMGAASTALTAQS RTLLAG I VQQQQQLLD VVKRQQ -NH2 

13 64 Ac - ALTAQSRTLLAG IVQQQQQLLDWKRQQELLRLTVWGT -NH2 

3 0 13 65 Ac - TLS AQSRTLLAG I VQQQQQLLD WKRQQEMLRLTVWGT -NH2 

13 66 Ac - TLTVQARQLLSG I VQQQNNLLRAIEAQQHLLQLTVWG I -NH2 

13 67 AC - WQAWIE YEAELS QVKEKIEQS LAYIREADELWAWF - NH2 
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1368 AC-WQAWIEYEASLSQAKEKIEESKAYIREADELWAWF-NH2 

136 9 Ac - WQAWIEYERLLVQAKLKIAIAKLYIAKELLEWAWF-NH2 

1370 AC - WQAWIEYERLLVQVKLKI AIALLYIAKELLE WAWF -NH2 

1371 Ac - WQAWIELERLLVQVKLKLAIAKLEIAKELLEWAWF -NH2 

1372 AC - GEWT YDDATKTFTVTEGGH -NH2 

1373 Ac - WQEWEQKIGEWTYDDATKTFTVTEGGHWASLWEWF -NH2 
5 1374 Ac - GEWTYDDATKTFTVTE - NH2 

1375 AC - WQEWEQKIGEWTYDDATKTFTVTEWASLWEWF -NH2 

1376 AC - MHRFDYRT -NH2 

137 7 Ac - WQEWEQKIMHRFDYRTWASLWEWF-NH2 
13 7 8 AC- MHRFNWS TGGG -NH2 

13 79 AC - WQEWEQK I MHRFNWS TGGG WASL WE W F -NH2 
13 80 Ac - MHRFNWST - NH2 

13 81 AC-WQEWEQKIMHRFNWSTWASLWEWF-NH2 
10 1382 AC-LLVPLARIMTMSSVHGGG-NH2 

1383 Ac - WQEWEQKILLVPLARIMTMSS VHGGGWASLWEWF -NH2 
13 84 AC-LLVPLARIKTMSSVH-NH2 

13 85 AC- WQEWEQKILLVPLARIMTMSSVHWASLWEWF-NH2 
1386 TALLEQAQIQQEKNEYELQKLDK 
13 87 AC-TALLEQAQIQQEKNEYELQKLDK-NH2 
13 88 Ac - TALLEQAQIQQEKIEYELQKLIE -NH2 
1389 TALLEQAQIQQEKIEYELQKLIE 
15 13 90 AC-QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERY-NH2 

13 91 Rhod-QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERY-NH2 

13 92 Ac - GAASLTLSAQSRTLLAGI VQQQQQLLD WKRQQEML -NH2 

13 93 Ac - G S AMGAASLTLS AQSRTLLAG XVQQQQQLLDWKRQQEML - NH 2 

1394 Ac - PALSTGLIHLHQNIVDVQFLFGVGSSIASWAIKWEY-NH2 

13 95 AC - PALSTGLIHLHQNIVDVQFLYGVGSSIASWAIK-NH2 

1396 AC - LSTTQWQVLPUSFTTLPALSTGLIHLHQNIVDVQY-NH2 

1397 AC - FRKFPEATFSRUGSGPRITPRUMVDFPFRLWHY-NH2 

20 1398 Ac-DFPFRLWHFPUTINYTIFKVRLFVGGVEHRLEAAUNWTR-NH2^ 

13 99 AC - YVGGVBHRLEAAUNWTRGERUDLEDRDRSELS PL - NH2 

14 00 MVYPSDEYDAS ISQVNEEINQALAYIRKADELLENV 

14 02 Ac - GPLLVLQAGFFLLTRILTI PQSLDS WWTS LNFLGG - NH2 
14 03 Ac - LGPLLVLQAGFFLLTRILTI PQSLDS WWTS LNFLG-NH2 
14 04 Ac - FLGPLLVLQAGFFLLTR I LTI PQSLDS WWTS LNFL-NH2 
14 05 Ac - YTNTIYTLLEESQNQQEKNEQELLELDKWASLWNWF-NH2 
14 06 YTNTIYTLLEESQNQQEKNEQELLELDKWASLWNWF 
2 5 14 07 Ac - YTGIIYNLLEESQNQQEKNEQELLELDKWANLWNWF -NH2 
14 08 YTG I I YNLLEESQNQQEKNEQELLELDKWANLWNWF 

1409 AC-YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF-NH2 

1410 YTSLIYSLLEKSQIQQEKNEQELLELDKWASLWNWF 

1411 AC-EKSQIQQEKNEQELLELDKWA-NH2 

1412 EKSQIQQEKNEQELLELDKWA 

1413 Ac -EQAQIQQEKNEYELQKLDKWA-NH2 

1414 Ac - YTS L IGSLIEESQIQQERNEQELLELDRWASLWEWF -NH2 

1415 Ac - YTXLIHSLIXESQNQQXKNEQELXELDKWASLWNWF-NH2 

1416 Ac - YTXL IHSLI WESQNQQXKNEQELXELD -NH2 

1417 Ac - YTS L I HSL I EES QNQQEKNEQELLELD -NH2 

1418 Ac -WQEQEXKITALLXQAQIQQXKNEYELXKLDKWASLWEWF-NH2 
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1419 AC-XKITALLXQAQIQQXKNEYELXKLDKWASLWEWF-NH2 
14 2 0 Ac - WQEWWXKITALLXQAQIQQXKNEYELXKLD-NH2 

14 21 AC-WEQKITALLEQAQIQQEKNEYELQKLD-NH2 

142 2 Ac -WEXKITALLXQAQIQQXKNEYELXKLD-NH2 

14 2 3 Ac - XKITALLXQAQIQQXKNEYELXKLD-NH2 

142 5 Ac - QKITALLEQAQIQQEKNE YELQKLD -NH2 

5 1426 Ac - QKITALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

14 27 Ac - WQEWEQKITALLEQAQ IQQEKNE YELQKLD - NH2 

1428 Ac - VYPSDEYDAS ISQVNEE INQALAYIRKADELLEN-OH 

142 9 Ac -VYPSDEYDAS ISQVNEE INQALAYIRKADELLE -OH 

14 3 0 AC - VYPSDEYDASISQVNEEINQALAYIRKADBLL-OH 

14 31 Ac -VYPSDEYDAS ISQVNEE INQALAYIRKADEL -OH 

14 32 YPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1433 PSDEYDAS I SQVNEEINQALAYIRKADELLENV- NH2 

10 14 34 SDEYDAS ISQVNEE INQALAY IRKADELLENV-NH2 

14 35 DEYDAS ISQVNEE INQALAY IRKADELLENV - NH2 

14 36 AC-VYPSDEYDASISQVDEEINQALAYIRKADELLENV-NH2 

14 37 Ac - VYPSDEYDAS ISQVNEE ID QALAYIRKADELLENV-NH 2 

14 38 AC - VYPSDEYDAS ISQVNEE INQALAY IRKADELLEDV-NH2 

14 3 9 Ac - VYPSDEYDAS IS QVDEEIDQALAY IRKADELLENV -NH2 

144 0 AC - LLSTNKAWSLSNGVS VLTSKVLDLKNYIDKQLLP -NH2 

1441 Ac - LSTNKAWS LSNGVS VLTSKVLDLKNYIDKQLLP I -NH2 

15 1442 AC - STNKAWSLSNGVS VGTSKVLDLKNYIDKQLLPI V-NH2 

1443 AC-TNKAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVN-NH2 

1444 AC - NKAVVSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK-NH2 
144 5 AC - KAWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQ -NH2 
1446 AC -AWSLSNGVSVLTSKVLDLKNYIDKQLLPIVNKQS -NH2 
144 7 AC - WS LSNGVS VLTS KVDLKNY I D KQ WLL P I VNKQSU - NH2 
144 8 Ac - VSLSNGVS VLTSKVLDLKNYIDKQLLP IVNKQSUS -NH2 

144 9 Ac - S LSNGVS VLTSKVLDLKNYIDKQLLP IVNKQSUS I -NH2 
2 o 1450 Ac - LSNGVS VLTS KVLDKLKNYIDKQLLP IVNKQSUS IS -NH2 

1451 Ac - SNGVSVLTSKVLDLKNYIDKQLLP IVNKQSUS I SN-NH2 

1452 AC - NGVS VLTSKVLDLKNYID KQLLP IVNKQSUS I SNI -NH2 

145 3 AC - GVSVLTS KVLDLKNYIDKQLLP IVNKQSUS ISNIE -NH2 
14 54 AC - VSVLTSKVLDLKNYIDKQLLPIVNKQSUS IS INIET -NH2 
14 55 AC - SVLTSKVLDLKNYIDKQLLPIVNKQSUSISNIETV-NH2 
14 56 Ac - VLTS KVLDLKNYIDKQLLPI VNKQSUS I SNI ETVI -NH2 
1457 AC - LTSKVLDLKNYIDKQLLPIVNKQSUS ISNIETVIE -NH2 
14 58 AC -TSKVLDLKNYIDKQLLPIVKQSUS ISNIETVIEF -NH2 
1459 Ac -SKVLDLKNYIDKQLLPIVNKQSUSISNIETVIEFQ-NH2 
14 60 Ac - KVLDLKNYIDKQLLP IVNKQSUS I SNIETVIEFQQ -NH2 
14 6 1 Ac - VLDLKNYIDKQLLPIVNKQSUS ISNIETVIBFQQK-NH2 
14 62 Ac - LDLKNY IDKQLLP IVNKQSUS I SNIETVIEFQQKN-NH2 
14 6 3 AC -DLKNYIDKQLLPIVNKQSUSISNIETVIEFQQKNN-NH2 
14 64 AC-LKNYIDKQLLPIVNKQSUSISNIETVIEFQQKNNR-NH2 
14 6 5 AC - KNYIDKQLLPIVNKQSUSISNIETVIEFQQKNNRL -NH2 
14 66 AC -NYIDKQLLP IVNKQSUS I SNI ETVIEFQQKNNRLL-NH2 

30 1467 AC-YIDKQLLPIVNKQSUSISNIETVIEFQQKNNRLLE-NH2 

146 8 AC - IDKQLLP IVNKQSUS ISNIETVIEFQQKNNRLLE I -NH2 
146 9 Ac -DKQLLP IVNKQSUS ISNIETVIEFQQKNNRLLEIT-NH2 
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1470 
1471 
1472 
1473 
1474 
1475 
* 1476 
1477 
1478 
1479 
1480 
1481 
1482 
1483 
1484 
14 85 
1486 
1487 
1486 
1489 
1490 
1491 
1492 
1493 
1494 
1495 
1496 
1497 
1498 
1499 
1500 
1501 
1502 
1503 
1504 
1505 
1506 
1507 
1508 
1509 
1510 
1511 
1512 
1513 
1514 
1515 
1516 
1517 
1518 
1519 



Sequence 



AC - KQLLPIVNKQSUSISNIETVIEFQQKNNRLLEITR-NH2 
AC - QLLPI VNKQSUSISNIETVIEFQQKNNRLLEITRE -NH2 
Ac - VYPSDEYDASISQVNEEINQALA 
0 VNEEINQALAYIRKADELLENV -NH2 
VYPSDEYDASISQVNEEINQALAYIRKADELLENV 
AC-DEYDASISQVNEEINQALAYIREADEL-NH2 
AC - DEYDAS ISQVNEKINQALAYIREADEL -NH2 
AC-DDECLNSVKNGTYDFPKFEEESKLNRNEIKGVKLS-NH2 
Ac - DDE - Ab U - LN S VKNGTYD F PKFEE E S KLNRNE I KG VKL S - NH 2 
AC - YHKCDDECLNSVKNGTFDFPKFEEESKLNRNEIKGVKLSS -NH2 
AC - YHK-Abu -DDE - Abu - LNS VKNGTFDFPKFEEESKLNRNE IKGVKLS S -NH2 
AC-YTSLIHSLIEESQIQQEKNEQELLELDKWASLWNWF-NH2 
Ac - YTSLIHSLIEESQNQQEKNE YELLELDKWASLWNWF-NH2 
AC-YTSLIHSLIEESQIQQEKNEYELLELDKWASLWNWF-NH2 
AC-YTSLIHSLIEESQIQQEKNEYELQKLDKWASLWNWF-NH2 
Ac - YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWNWF -NH2 
Ac - YTSLIHSLIEESQNQQEKNE YELQKLDKWASLWNWF -NH2 
Ac - YTSLIHSLIEESQIQQEKNEQELQKLDKWASLWNWF-NH2 
AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWEWF-NH2 
Ac - YTSLIHSLIEESQIQOEKNEQELLELDKWASLWEWF - NH2 
Ac-YTSLIHSLIEESQNQQEKNEYEI^ELDKWASLWEWF-NH2 
AC-YTSLIHSLIEESQIQ0EKNEYELLELDKWASLWEWF-NH2 
AC - YTSLIHSLIEESQIQOEKNEYELOKLDKWASLWEWF -NH2 
AC-YTSLIHSLIEESQNQQEKNEQELQKLDKWASLWEWF-NH2 
Ac- YTSLIHSLIEESQNQQEKNE YELQKLDKWASLWEWF-NH2 
AC - YTSLIHSLIEESQIQQEKNEQELQKLDKWASLWEWF -NH2 
AC-WQEQEQKITALLEQAQIQQEKNEYELQKLDKEWWF-NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIEWASLWEWF-NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELQKLAKWASLWEWF-NH2 
Ac - WQEWEQKITALLEQAQIQQEKNEYELQKLIKWASLWEWF -NH2 
Ac - WQEWEQKITALLEQAQ IQQEKNEYELQKL IEWAGL WEWF -NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELQKLAKWAGLWEWF-NH2 
Ac -WQEWEQKITALLEQAQ IQQEKNEYELQKL I KWAGLWEWF-NH2 
Ac -WQEWEQKITALLEQAQ I QQEKNEYELQKLIEWAGLWAWF-NH2 
Ac - WQEWEQKITALLEQAQ IQQEKNE YELQKLAKWAGLWAWK -NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELQKLIKWAGLWAWF-NH2 
Ac - WQEWEQKITALLEQAQIQQEKGEYELQKLDKQEQF-NH2 
AC - WQEWEQKITALLEQAQIQQEKGE YELLELDKWEWF-NH2 
Ac - WQEWEQKITALLEQAQ IQQEKGEYELQKLAKWEWF-NH2 
Ac - WQEWEQKITALLEQAQIQQEKGEYELQKLDWQWEF -NH2 
Ac - WQEWEQKITALLEQAQIQQEKGEYELLELAKWEWF-NH2 
AC-WEQWEQKITALLEQAQIQQEKNEYELLELDKWEWF-NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELEEELIEWASLWEWF-NH2 
Ac - WQEWEQKITALLEQAQIQQEKNEYELLELIEWAGLWEWF -NH2 
AC-WQEWEQKITALLEQAQIQQEKNEYELLELIEWAGLWAWF-NH2 
AC-WQEWEREITALLEQAQIQQEKNEYELQKLIEWASLWEWF-NH2 
Ac - WQEWEREIQQEKNEYELQKLDKWASLWEWF -NH2 
Ac - WQEWEREIQQEKGEYELQKLIEWEWF-NH2 
Ac - WQEWQAQIQQEKNEYELQKLDKWASLWEWF -NH2 
AC-WQEWQAQIQQEKGEYELQKLIEWEWF-NH2 
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1520 PEG - GWQEWEQRITALLEQAQIQQERNEYELQRLDEWASLWE WF -NH2 

1521 AC-GWQEWEQRITALLEQAQIQQERNEYELQRLDEWASLWEWF-NH2 

1522 PEG- YTSLITAL LEQAQ IQQERNEQELLELDEWASLWEWF - NH2 

1523 AC-YTSLITALLEQAQIQQERNEQELLELDEWASLWEWF-NH2 

1526 PEG - GWQEWEQR ITALLEQAQI QQERNEYELQELDEWASLWEWF -NH2 

1527 Ac -GWQEWEQRITALLEQAQIQQERNEYELQBLDEWASLWEWF-NH2 
5 1528 PEG-YTSLIGSLIEESQIQQERNEQELLELDRWASLWEWF-NH2 

152 9 PEG- GWQEWEQRITALLEQAQIQQERNEYELQRLDRWASLWEWF-NH2 

153 0 Ac -GWQEWEQRITALLEQAQIQQERNEYELQRLDRWASLWEWF-NH2 
1 S 3 1 PEG - GWQEWE QR ITALLEQAQ IQQERNEYELQELDRWASLWEWF - NH2 

1532 AC -GWQEWEQR ITALLEQAQ I QQERNEYBLQELDRWAS LWEWF -NH 2 

1533 PEG-YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 

1534 AC - YTSLIGSLIEESQNQQERNEQELLELDRWASLWNWF-NH2 

153 8 Ac-YTSLIHSLIEESQNQQEK-OH 
10 153 9 NEQELLELDK 

1540 WASLWNWF -NH2 *' 

154 2 AC - AAAWEQK2TALLEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 
154 3 Ac - WQEAAAKITALLEQAQIQQEKNEYELQKLDKWASLWEWF -NH2 
154 4 AC-WQEWEQAAAALLEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1545 Ac -WQEWEQKITAAAEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1546 AC - WQEWE QKITALLAAAQ I QQEKNE YELQKLD KWAS LWEWF - NH2 

1547 AC - WQEWEQKITALLEQAAAAQEKNE YELQKLDKWASLWE WF - NH2 
15 154 8 Ac - WQEWEQKITALLEQAQIQAAANEYELQKLDKWASLWEWF-NH2 

154 9 AC-WQEWEQKITALLEQAQIQQEKAAAELQKLDKWASLWEWF-NH2 

1550 AC -WQEWEQK ITALLEQAQI QQEKNE YAAAKLD KWAS LWEWF -NH2 

1551 Ac -WQEWEQK ITALLEQAQ I QQEKNEYELQAAAKWAS LWEWF -NH2 

1552 AC - WQEWEQKITALLEQAQIQQEKNEYELQKLDAAASLWEWF -NH 

1553 Ac-WQEWEQKITALLEQAQIQQEKNEYELQKLDXWAAAAEWF-NH 

1554 Ac -WQEWEQKITALLEQAQIQQEKNE YELQKLD KWASLWAAA-NH 
1556 Ac - YTSL IHSLIEESQNQQEKNEQELLLDKWASLWNWF -NH2 

2 0 1557 Ac- YTSLIHSLIEESQNQEKNEQELLELDKWASLWNWF-NH2 

1558 AC - ERTLDFHDS -NH2 

1559 Ac- YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN (W) F-NH2 

1563 Ac- YTSLIHSLIEESQN (Q) QEKNEQELLELDKWASLWNWF-NH2 

1564 Ac- YTSLIHSLIEESQNQQDKWASLWNWF-NH2 

1566 AC - FYEIIMDIEQNNVQGKKGIQQLQKWEDWVGWIGNI -NH2 

1567 Ac - INQTIWNHGNITLGEWYNQTKDLQQKFYEIIMDIE-NH2 

1568 AC-WNHGNITLGEWYNQTKDLQQKFYEIIMDIEQNNVQ-NH2 
25 1572 Ac - YTS L IHSLIEESENQQEKNEQELLELDKWASLWNWF - NH2 

1573 AC-YTSLIHSLIEESQDQQEKNEQELLELDKWASLWNWF-NH2 

1574 AC-YTSLIHSLIEESQNEQEKNEQELLELDKWASLWNWF-NH2 

1575 C-YTSLIHSLIEESQNQEEKNEQELLELDKWASLWNWF-NH2 

1576 AC - YTSLIHSLIEESQNQQEKDEQELLELDKWASLWNWF-NH2 

1577 AC - LGEWYNQTKDLQQKF YE HMD I EQNNVQGKKG IQQ-NH2 

1578 AC-WYNQTKDLQQKFYEIIMDIEQNNVQGKKGIQQLQK-NH2 

157 9 AC-YTSLIHSLIEESQNQQEKNEEELLELDKWASLWNWF-NH2 
1 S 8 0 AC-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWDWF-NH2 

* 0 15 8 6 AC-XTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWX-NH2 

158 8 Ac - YNQTKDLQQKFYEI IMDIEQNNVQGKXGIQQLQKW-NH2 
1598 Ac - YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
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1600 AC -TLTVQARQLLSG I VQQQNNLLRAIEAQQHLLQLTVWG IKQLQAR - NH2 
1603 Ac - LQQKFYEI IMDIEQNNVQGKKGIQQLQKWEDWVGW -NH2 
162 7 AC-YTSLIHSLIEESQNQQEKNEQEIiIALDKWASLWNWF-NH2 
162 8 AC-YTSLIHSLIEESQNQQEKNEQELLEADKWASLWNWF-NH2 

162 9 Ac- YTSLIHSLIEESQNQQEKNEQELLEIJUCWASLWNWF-NH2 

163 0 AC-YTSLIHSLIEESQNQQEKAEQELLELDKWASLWNWF-NH2 
5 1631 Ac - YTSLIHSLIEESQNQQEKNAQELLELDKWASLWNWF-NH2 

163 2 Ac - YTSL1HSLIEESQNQQEKNEAELLELDKWASLWNWF-NH2 
1634 Ac- WQEWEQKITALLEQAQIQQEKNEQELQKLDKWASLWEWF -NH2 

163 5 AC - WQEWEQKITALLEQAQIQQEKAEYELQKLDKWASLWEWF-NH2 

1636 AC - WQEWEQK ITALLEQAQ I QQE KNAYELQKLD KWAS LWE WF - NH2 

1637 Ac - WQEWEQKITALLEQAQIQQEKNEAELQKLDKWASLWEWF-NH2 
1644 AC-EYDLRRWEK-NH2 

164 5 Ac - EQELLELDK -NH2 
10 164 6 Ac - E YELQKLDK -NH2 

1647 AC-WQEWEQKITALLEQAQIQQEKNEQELLKLDKWASLWEWF-NH2 
164 8 Ac - WQEWEQKITALLEQAQIQQEKNEQELLELDKWASLWEWF -NH2 
164 9 Ac - WQEWEQKITALLEQAQIQQEKNDKWASLWEWF -NH2 

1650 AC-YTSLIHSLIEESQNQAEKNEQELLELDKWASLWNWF-NH2 

1651 AC-YTSLIHSLIEESQNQ0AKNEQELLELDKWASLWNWF-NH2 

1652 AC - YTSLIHSLIEESQNQQEANEQELLELDKWASLWNWF-NH2 

1653 AC - YTSLIHSLIEESANQQEANEQELLELDKWASLWNWF - NH2 
15 1654 AC-YTSLIHSLIEESQAQQEKNEQELLELDKWASLWNWF-NH2 

1655 AC - YTSLIHSLIEESQKAQEKNEQELLELDKWASLWNWF - NH2 

1656 Ac - YTSLIHALIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1657 AC - YTSLIHSAIEESQNQQEKNEQELLELDKWASLWNWF - NH2 

1658 Ac -VYPSDEYDASISQVNEEINQALAYIRKADELLENV-NH2 

1659 AC - YTSIilHSLAEESQNQQEKNEQELIiELDKWASLWNWF - NH2 

1660 Ac - YTSAIHSLIEESQNQQEKNEQELLELDKWASLWNWF - NH2 

1661 Ac - YTSliAHSLIEESQNQQEKNEQELtiELDKWASLWNWF - NH2 
2 0 1662 AC-YTSLIASLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1663 AC-ATSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-NH2 

1664 Ac - YASLIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 
16 65 AC - YTALIHSLIEESQNQQEKNEQELLELDKWASLWNWF -NH2 

1666 Ac- RIQDLEKYVEDTKIDLWS YNAELLVALENQ -NH2 

1667 Ac -HTIDLTDSEMNKLFEKTRRQLREN -NH2 

166 8 Ac-SEMNKLFEKTRRQLREN -NH2 

1669 AC-VFPSDEADASISQVNEKINQSLAFIRKSDELLHNV-NH2 

167 0 AC - VFPSDEFAAS ISQVNEKINQSLAFIRKSDELLHNV-NH2 

1671 Ac-VFPSDEFDASISAVNEKINQSIiAFIRKSDELLHNV-NH2 

1672 Ac - VFPSDEFDAS I S QANEKINQS LAF I RKS DELLHNV - NH2 

1673 Ac- VFPSDEFDAS ISQVAEKINQSLAFIRKSDELLHNV-NH2 

1674 AC - WQEWEQKITAALEQAQIQQEKNEYELQKLDKWASLWEWF-NH2 

1675 AC-WQEWEQKITALAEQAQIQQEKNEYELQKLDKWASI*WEWF-NH2 

1676 AC - WQEWEQKITALLEQAAlQQEKNEYELQKIiDKWASLWEWF -NH2 

1677 Ac - WQEWEQKITALLEQAQAQQEKNE YELQKLDKWASLWEWF - NH2 

1678 AC - WQEWEQKITALLEQAQIAQEKNEYELQKLDKWASLWEWF-NH2 
* 0 1679 AC-WQEWEQKITALLEQAQIQAEKNEYELQKLDKWASLWEWF-NH2 

16 8 0 Ac - VFPSDEFDAS ISQVNEKINQSAAFIRKSDELLHNV-NH2 
1681 Ac - VFPSDEFDAS ISQVNEKINQSLAAIRKSDELLHNV -NH2 
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T 

No . Sequence 

16 82 Ac - VFPSDEFDASISQVNEKINQSLAFIRKSDEALHNV-NH2 

168 3 Ac - VFPSDEFDASISQVNEKINQSLAFIRKSDELAHNV-NH2 

16 84 Ac - VFPSDEFDAS ISQVNEKINQSLAFIRKSDELLANV-NH2 

1685 Ac - WQEWEQKITALLEQAQIQQAKNEYELQKLDKWASLWEWF-NH2 

1687 Ac - WQEWEQKITALLEQAQIQQEKNEYELQALDKWASLWEWF - NH2 

168 8 AC - WQEWEQKITALLEQAQIQQEKNEYELQKADKWASLWEWF -NH2 

5 



10 



15 



20 



25 



30 
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5.4 . SYNTHESIS OF PEPTIDES 
The peptides of the invention may be synthesized 
or prepared by techniques well known in the art. See, 
for example, Creighton, 1983, Proteins: Structures 
and Molecular Principles, W.H. Freeman and Co., NY, 
5 which is incorporated herein by reference in its 
entirety. Short peptides, for example, can be 
synthesized on a solid support or in solution. Longer 
peptides may be made using recombinant DNA techniques. 
Here, the nucleotide sequences encoding the peptides 
10 of the invention may be synthesized, and/or cloned, 
and expressed according to techniques well known to 
those of ordinary skill in the art. See, for example, 
Sambrook, et al . , 198 9, Molecular Cloning, A 
Laboratory Manual, Vols. 1-3, Cold Spring Harbor 
Press, NY. 

15 

The peptides of the invention may alternatively 
be synthesized such that one or more of the bonds 
which link the amino acid residues of the peptides are 
non-peptide bonds. These alternative non-peptide 
bonds may be formed by utilizing reactions well known 

20 to those in the art, and may include, but are not 

limited to imino, ester, hydrazide, semicarbazide, and 
azo bonds, to name but a few. In yet another 
embodiment of the invention, peptides comprising the 
sequences described above may be synthesized with 

25 additional chemical groups present at their amino 
and/or carboxy termini, such that, for example, the 
stability, bioavailability, and/or inhibitory activity 
of the peptides is enhanced. For example, hydrophobic 
groups such as carbobenzoxyl , dansyl, or t- 
butyloxycarbonyl groups, may be added to the peptides 1 
amino termini. Likewise, an acetyl group or a 9- 
f luorenylmethoxy-carbonyl group may be placed at the 
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peptides' amino termini. (See "X n Tables I to IV, 
above.) Additionally, the hydrophobic group, t- 
butyloxycarbonyl , or an amido group may be added to 
the peptides' carboxy termini. (See "Z" in Tables I 
to IV, above.) 
5 Further, the peptides of the invention may be 

synthesized such that their steric configuration is 
altered. For example, the D- isomer of one or more of 
the amino acid residues of the peptide may be used, 
rather than the usual L- isomer. 
10 Still further, at least one of the amino acid 

residues of the peptides of the invention may be 
substituted by one of the well known non-naturally 
occurring amino acid residues. Alterations such as 
these may serve to increase the stability, 
bioavailability and/or inhibitory action of the 
peptides of the invention. 

Any of the peptides described above may, 
additionally, have a macromolecular carrier group 
covalently attached to their amino and/or carboxy 
termini. Such macromolecular carrier groups may 
20 include, for example, lipid-fatty acid conjugates, 
polyethylene glycol, carbohydrates or additional 
peptides.' "X M , in Tables I to IV, above, may 
therefore additionally represent any of the above 
macromolecular carrier groups covalently attached to 
25 the amino terminus of a peptide, with an additional 
peptide group being preferred. Likewise, "Z", in 
Tables I to IV, may additionally represent any of the 
macromolecular carrier groups described above. 

30 5 - 5 - ASSAYS FO R ANT I - MEMBRANE FUSION ACTIVITY 

Described herein, are methods for ability of a 
compound, such as the peptides of the invention, to 
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inhibit membrane fusion events. Specifically, assays 
for cell fusion events are described in Section 5.5.1, 
below, and assays for antiviral activity are described 
in Section 5.5.2, below. 

5 5.5.1 ASSAYS FOR CELL FUSION EVENTS 

Assays for cell fusion events are well known to 
those of skill in the art, and may be used in 
conjunction, for example, with the peptides of the 
invention to test the peptides' antifusogenic 
10 capabilities. 

Cell fusion assays are generally performed in 
vitro. Such an assay may comprise culturing cells 
which, in the absence of any treatment would undergo 
an observable level of syncytial formation. For 
example, uninfected cells may be incubated in the 
presence of cells chronically infected with a virus 
that induces cell fusion. Such viruses may include, 
but are not limited to, HIV, SIV, or respiratory 
syncytial virus. 

For the assay, cells are incubated in the 

2 0 

presence of a peptide to be assayed. For each 
peptide, a range of peptide concentrations may be 
tested. This range should include a control culture 
wherein no peptide has been added. 

Standard conditions for culturing cells, well 
25 known to those of ordinary skill in the art, are used. 
After incubation for an appropriate period (24 hours 
at 3 7°C, for example) the culture is examined 
microscopically for the presence of multinucleated 
giant cells, which are indicative of cell fusion and 
3q syncytial formation. Well known stains, such as 

crystal violet stain, may be used to facilitate the 
visualization of syncytial formation. 
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5.5.2 ASSAYS FOR ANTIVIRAL ACTIVITY 
The antiviral activity exhibited by the peptides 
of the invention may be measured, for example, by 
easily performed in vitro assays, such as those 
described below, which can test the peptides' ability 
5 to inhibit syncytia formation, or their ability to 
inhibit infection by cell- free virus. Using these 
assays, such parameters as the relative antiviral 
activity of the peptides, exhibit against a given 
strain of virus and/or the strain specific inhibitory 
10 activity of the peptide can be determined. 

A cell fusion assay may be utilized to test the 
peptides 1 ability to inhibit viral -induced, such as 
HIV- induced, syncytia formation in vitro . Such an 
assay may comprise culturing uninfected cells in the 
presence of cells chronically infected with a 

15 

syncytial -inducing virus and a peptide to be assayed. 
For each peptide, a range of peptide concentrations 
may be tested. This range should include a control 
culture wherein no peptide has been added. Standard 
conditions for culturing, well known to those of 
20 ordinary skill in the art, are used. After incubation 
for an appropriate period (24 hours at 3 7°C, for 
example) the culture is examined microscopically for 
the presence of multinucleated giant cells, which are 
indicative of cell fusion and syncytia formation. 
25 Well known stains, such as crystal violet stain, may 
be used to facilitate syncytial visualization. Taking 
HIV as an example, such an assay would comprise CD-4* 
cells (such as Molt or CEM cells, for example) 
cultured in the presence of chronically HIV-infected 
cells and a peptide to be assayed. 

Other well known characteristics of viral 
infection may also be assayed to test a peptide's 
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antiviral capabilities. Once again taking HIV as an 
example, a reverse transcriptase (RT) assay may be 
utilized to test the peptides 1 ability to inhibit 
infection of CD-4* cells by cell-free HIV. Such an 
assay may comprise culturing an appropriate 
5 concentration ( i.e. , TCID 50 ) of virus and CD-4* cells 
in the presence of the peptide to be tested. Culture 
conditions well known to those in the art are used. 
As above, a range of peptide concentrations may be 
used, in addition to a control culture wherein no 
10 peptide has been added. After incubation for an 
appropriate period ( e.g. , 7 days) of culturing, a 
cell -free supernatant is prepared, using standard 
procedures, and tested for the present of RT activity 
as a measure of successful infection. The RT activity 
^ may be tested using standard techniques such as those 
described by, for example, Goff et al . (Goff , S. et 
al., 1981, J. Virol. 38:239-248) and/or Willey et_al. 
(Willey, R. et al . . 1988, J. Virol. 62:139-147). 
These references are incorporated herein by reference 
in their entirety. 
20 Standard methods which are well-known to those of 

skill in the art may be utilized for assaying non- 
retroviral activity. See, for example, Pringle et al . 
(Pringle, C.R. et al . , 1985, J. Medical Virology 
17:377-386) for a discussion of respiratory syncytial 
2 5 virus and parainfluenza virus activity assay 

techniques. Further, see, for example, "Zinsser 
Microbiology" , 1988, Joklik, W.K. et al . , eds . , 
Appleton Sc Lange, Norwalk, CT, 19th ed., for a general 
review of such techniques. These references are 
incorporated by reference herein in their entirety. 
In addition, the Examples presented below, in Sections 
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17, 18, 26 and 27 each provide additional assays for 
the testing of a compound's antiviral capability. 

In vivo assays may also be utilized to test, for 
example, the antiviral activity of the peptides of the 
invention. To test for ant i -HIV activity, for 
5 example, the in vivo model described in Barnett et al. 
(Barnett, S.W. et al., 1994, Science 266:642-646) may 
be used. 

Additionally, anti-RSV activity can be assayed in 
vivo via well known mouse models. For example, RSV 
10 can be administered intranasally to mice of various 
inbred strains. Virus replicates in lungs of all 
strains, but the highest titers are obtained in P/N, 
C57L/N and DBA/2N mice. Infection of BALB/c mice 
produces an asymptomatic bronchiolitis characterized 
by lymphocytic infiltrates and pulmonary virus titers 
of 10* to 10 s pfu/g of lung tissue (Taylor, G. et al . , 
1984, Infect. Immun. 4J3:649-655) • 

Cotton rat models of RSV are also well known. 
Virus replicates to high titer in the nose and lungs 
of the cotton rat but produces few if any signs of 
inflammation. 

5.6. USES OF THE PEPTIDES OF THE INVENTION 
The peptides of the invention may be utilized as 
antif usogenic or antiviral compounds, or as compounds 

25 which modulate intracellular processes involving 
coiled coil peptide structures. Further, such 
peptides may be used to identify agents which exhibit 
ant if usogenic, antiviral or intracellular modulatory 
activity. Still further, the peptides of the 

3o invention may be utilized as organism or viral 
type/subtype-specific diagnostic tools. 
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The antifusogenic capability of the peptides of 
the invention may additionally be utilized to inhibit 
or treat/ameliorate symptoms caused by processes 
involving membrane fusion events. Such events may 
include, for example, virus transmission via cell-cell 
5 fusion, abnormal neurotransmitter exchange via cell- 
fusion, and sperm-egg fusion. Further, the peptides 
of the invention may be used to inhibit free viral, 
such as retroviral, particularly HIV, transmission to 
uninfected cells wherein such viral infection involves 
10 membrane fusion events or involves fusion of a viral 
structure with a cell membrane. Among the 
intracellular disorders involving coiled coil peptides 
structures which may be ameliorated by the peptides of 
the invention are disorders involving, for example, 
bacterial toxins. 

With respect to antiviral activity, the viruses 
whose transmission may be inhibited by the peptides of 
the invention include, but are not limited to human 
retroviruses, such as HIV-l and HIV- 2 and the human T- 
lymphocyte viruses (HTLV-I and II), and non-human 
20 retroviruses such as bovine leukosis virus, feline 

sarcoma and leukemia viruses, simian immunodeficiency, 
sarcoma and leukemia viruses, and sheep progress 
pneumonia viruses. 

Non retroviral viruses whose transmission may be 
25 inhibited by the peptides of the invention include, 
but are not limited to human respiratory syncytial 
virus, canine distemper virus, newcastle disease 
virus, human parainfluenza virus, influenza viruses, 
measles viruses, Epstein-Barr viruses, hepatitis B 
30 viruses ' and simian Mason-Pfizer viruses. 

Non enveloped viruses whose transmission may be 
inhibited by the peptides of the invention include, 
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but are not limited to picornaviruses such as polio 
viruses, hepatitis A virus, enterovirus, echoviruses 
and coxsackie viruses, papovaviruses such as papilloma 
virus, parvoviruses, adenoviruses and reoviruses. 

As discussed more fully, below, in Section 5.6.1 
5 and in the Example presented, below, in Section 8, 
DP107, DP178, DP107 analog and DP178 analog peptides 
form non-covalent protein-protein interactions which 
are required for normal activity of the virus. Thus, 
the peptides of the invention may also be utilized as 
10 components in assays for the identification of 

compounds that interfere with such protein-protein 
interactions and may, therefore, act as antiviral 
agents. These assays are discussed, below, in Section 
5.6.1. 

As demonstrated in the Example presented below in 
Section 6, the antiviral activity of the peptides of 
the invention may show a pronounced type and subtype 
specificity, i.e. , specific peptides may be effective 
in inhibiting the activity of only specific viruses. 
This feature of the invention presents many 

20 advantages. One such advantage, for example, lies in 
the field of diagnostics, wherein one can use the 
antiviral specificity of the peptide of the invention 
to ascertain the identity of a viral isolate. With 
respect to HIV, one may easily determine whether a 

25 viral isolate consists of an HIV-l or HIV-2 strain. 

For example, uninfected CD-4* cells may be co- infected 
with an isolate which has been identified as 
containing HIV the DP178 (SEQ ID:1) peptide, after 
which the retroviral activity of cell supernatants may 
be assayed, using, for example, the techniques 
described above in Section 5.2. Those isolates whose 
retroviral activity is completely or nearly completely 
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inhibited contain HIV-l. Those isolates whose viral 
activity is unchanged or only reduced by a small 
amount, may be considered to not contain HIV-l, Such 
an isolate may then be treated with one or more of the 
other DP178 peptides of the invention, and 
5 subsequently be tested for its viral activity in order 
to determine the identify of the viral isolate. The 
DP107 and DP178 analogs of the invention may also be 
utilized in a diagnostic capacity specific to the type 
and subtype of virus or organism in which the specific 
10 peptide sequence is found. A diagnostic procedure as 
described, above, for DP178, may be used in 
conjunction with the DP107/DP178 analog of interest. 

5.6.1. SCREENING ASSAYS 
As demonstrated in the Example presented in 
Section 8, below, DP107 and DP178 portions of the TM 
protein gp4l, i.e., the HRl and HR2 portions of gp4l, 
respectively, form non-covalent protein-protein 
interactions. As is also demonstrated, the 
maintenance of such interactions is necessary for 

20 normal viral infectivity. Thus, compounds which bind 
DP107, bind DP178, and/or act to disrupt normal 
DP107/DP178 protein-protein interactions may act as 
antifusogenic, antiviral or cellular modulatory 
agents. Described below are assays for the 

25 identification of such compounds. Note that, while, 
for ease and clarity of discussion, DP107 and DP178 
peptides will be used as components of the assays 
described, but it is to be understood that any of the 
DP107 analog or DP178 analog peptides described, 

3q above, in Sections 5.1 through 5.3 may also be 
utilized as part of these screens for compounds. 
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For example-, in certain embodiments the assays of 
the invention may be use DP10 7 and/or DP178 analogs 
that contain one or more amino acid residue 
truncations, deletions, insertions or substitutions. 
In particular, in one preferred embodiment, the DP107, 
5 DP178, DP107-like and DP178-like peptides can comprise 
amino and/or carboxy- terminal insertions corresponding 
to about two to about fifty amino acids araino-to or 
carboxy- to the endogenous sequence from which the 
DP107, DP178, DP107-like or DP178-like peptide is 

10 derived. In another particular embodiment, the 

peptides used in the assays described herein further 
comprise additional, heterologous sequence useful for 
detecting, immobilizing and/or purifying the 
particular peptide. Such heterologous sequences 

^ include, but are not limited to maltose binding fusion 
proteins containing a DP178, DP107, DP178-like or 
DP107-like sequence such as the M41A178 and MF5.1 
maltose binding fusion proteins described in Sections 
8 and 30, below. 

In certain embodiments, such analogs will have 

20 reduced binding affinities and are therefore useful, 
e.g., to screen for compounds which inhibit the 
formation of or, alternatively, disrupt complexes 
between DP107/DP17 8 complexes. Among such reduced 
binding analogs are peptides exhibiting one or more 

25 alanine insertion or substitutions, including, e.g., 
the peptides described in the examples presented in 
Sections 30 and 31, below. It is understood that such 
analogs which have reduced binding affinities, 
including the analogs described in Sections 3 0 and 31 
below, are also part of the present invention. 
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15 



30 



Compounds which may be tested for an ability to 
bind DP107, DP178, and/or disrupt DP107/DP178 
interactions, and which therefore, potentially 
represent ant if usogenic, antiviral or intracellular 
modulatory compounds, include, but are not limited to, 
5 peptides made of D- and/or L-conf iguration amino acids 
(in, for example, the form of random peptide 
libraries; see Lam, K.S. et al . , 1991, Nature 354:82- 
84), phosphopeptides (in, for example, the form of 
random or partially degenerate, directed 
10 phosphopeptide libraries; see, for example, Songyang, 
Z. et al. , 1993, Cell 72 : 767-778) , antibodies, and 
small organic or inorganic molecules. Synthetic 
compounds, natural products, and other sources of 
potentially effective materials may be screened in a 
variety of ways, as described in this Section. 

Compounds that can be screened, tested and 
identified as modulating HR1/HR2, DP178/DP107 and/or 
DP178-like/DP107-like interactions utilizing the 
methods described herein can, in general, include, 
e.g., small molecules that are of a molecular weight 
up to about 1500 daltons. Test compounds, including 
small molecules, can include, but are not limited to, 
compounds obtained from any commercial source, 
including Aldrich (1001 West St. Paul Ave., Milwaukee, 
WI 53233), Sigma Chemical (P.O. Box 14508, St. Louis, 
MO 63178), Fluka Chemie AG (Industriestrasse 25, CH- 
9471 Buchs, Switzerland (Fluka Chemical Corp. 980 
South 2nd Street, Ronkonkoma, NY 11779)), Eastman 
Chemical Company, Fine Chemicals (P.O Box 431, 
Kingsport, TN 37662), Boehringer Mannheim GmbH 
(Sandhofer Strasse 116, D-68298 Mannheim), Takasago (4 
Volvo Drive, Rockleigh, NJ 07647), SST Corporation 
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(635 Brighton Road, Clifton, NJ 07012), Ferro (ill 
West Irene Road, Zachary, LA 70791), Riedel-deHaen 
Aktiengesellschaft (P.O. Box D-30918, Seelze, 
Germany), PPG Industries Inc., Fine Chemicals (One PPG 
Place, 34th Floor, Pittsburgh, PA 15272) . Further any 
kind of natural products may be screened using the 
methods of the invention, including microbial, fungal 
or plant extracts. 

Furthermore, diversity libraries of test 
compounds, including small molecule test compounds, 
may be commercially obtained from Specs and BioSpecs 
B.V. (Rijswijk, The Netherlands) , Chembridge 
Corporation (San Diego, CA) , Contract Service Company 
(Dolgoprudny, Moscow Region, Russia) , Comgenex USA 
Inc. (Princeton, NJ) , Maybridge Chemicals Ltd. 
(Cornwall PL34 OHW, United Kingdom) , and Asinex 
(Moscow, Russia) . Combinatorial libraries of test 
compounds, including small molecule test compounds, 
can be may be generated as disclosed in Eichler & 
Houghten, 1995, Mol. Med. Today 1:174-180; Dolle, 
1997, Mol. Divers. 2:223-236; Lam, 1997, Anticancer 
Drug Des. 12- 145-167. These references are 
incorporated hereby by reference in their entirety. 
It is to be noted that such references also teach 
additional screening methods which may be employed for 
the further testing of compounds identified via the 
methods of the invention and which can aid in 
identifying and isolating compounds which can 
represent leads and therapeutic compounds. 
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The compounds, antibodies, or other molecules 
identified may be tested, for example, for an ability 
to inhibit cell fusion or viral activity, utilizing, 
for example, assays such as those described, above, in 
Section 5.5. 

5 Among the peptides which may be tested are 

soluble peptides comprising DP107 and/or DP178 
domains, and peptides comprising DP107 and/or DP178 
domains having one or more mutations within one or 
both of the domains, such as the M41-P peptide 
10 described, below, in the Example presented in Section 
8, which contains a isoleucine to proline mutation 
within the DP178 sequence. 

In one embodiment of such screening methods is a 
method for identifying a compound to be tested for 
antiviral ability comprising: 

1 exposing at least one compound to a 
peptide comprising a DP107 peptide for a time 
sufficient to allow binding of the compound to the 
DPI 07 peptide; 

2 removing non-bound compounds; and 

2 0 

3 determining the presence of the 
compound bound to the DPI 07 peptide, 
thereby identifying an agent to be tested for 
antiviral ability. 

In a second embodiment of such screening methods 
25 is a method for identifying a compound to be tested 
for antiviral ability comprising: 

(a) exposing at least one compound to a 
peptide comprising a DP178 peptide for a time 
sufficient to allow binding of the compound to the 

3o DP178 peptide; 

(b) removing non-bound compounds; and 
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(c) determining the presence of the 
compound bound to the DP178 peptide, 
thereby identifying an agent to be tested for 
antiviral ability. 

One method utilizing these types of approaches 
5 that may be pursued in the isolation of such DP107- 
binding or DP178 -binding compounds is an assay which 
would include the attachment of either the DP107 or 
the DP178 peptide to a solid matrix, such as, for 
example, agarose or plastic beads, microtiter plate 

10 wells, petri dishes, or membranes composed of, for 
example, nylon or nitrocellulose. In such an assay 
system, either the DP107 or DP178 protein may be 
anchored onto a solid surface, and the compound, or 
test substance, which is not anchored, is labeled, 
either directly or indirectly (e.g., with a 
radioactive label such as 125 I, an absorption label 
such as biotin, or a fluorescent label such as 
fluorescein or rhodamine) . In practice, microtiter 
plates are conveniently utilized. The anchored 
component may be immobilized by non-covalent or 

20 covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with 
a solution of the protein and drying. Alternatively, 
an immobilized antibody, preferably a monoclonal 
antibody, specific for the protein may be used to 

2 5 anchor the protein to the solid surface. The surfaces 
may be prepared in advance and stored. 

In order to conduct the assay, the labeled 
compound is added to the coated surface containing the 
anchored DP107 or DP178 peptide. After the reaction 
is complete, unreacted components are removed ( e.g. , 
by washing) under conditions such that any complexes 
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formed will remain immobilized on the solid surface. 
The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the compound is pre-labeled, the detection of 
label immobilized on the surface indicates that 
5 complexes were formed. Where the labeled component is 
not pre-labeled, an indirect label can be used to 
detect complexes anchored on the surface; e.g. , using 
a labeled antibody specific for the compound (the 
antibody, in turn, may be directly labeled or 

10 indirectly labeled with a labeled anti-Ig antibody) . 

Alternatively, such an assay can be conducted in 
a liquid phase, the reaction products separated from 
unreacted components, and complexes detected; e.g. . 
using an immobilized antibody specific for DP107 or 

^ DP178, whichever is appropriate for the given assay, 
or ab antibody specific for the compound, i.e. , the 
test substance, in order to anchor any complexes 
formed in solution, and a labeled antibody specific 
for the other member of the complex to detect anchored 
complexes . 

20 By utilizing procedures such as this, large 

numbers of types of molecules may be simultaneously 
screened for DP107 or DP178-binding capability, and 
thus potential antiviral activity. 

Further, compounds may be screened for an ability 

25 to inhibit the formation of or, alternatively, disrupt 
DP107/DP178 complexes. Such compounds may then be 
tested for antifusogenic, antiviral or intercellular 
modulatory capability. For ease of description, DP107 
and DP178 will be referred to as "binding partners." 
Compounds that disrupt such interactions may exhibit 
antiviral activity. Such compounds may include, but 
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are not limited to molecules such as antibodies, 
peptides, and the like described above. 

The basic principle of the assay systems used to 
identify compounds that interfere with the interaction 
between the DP107 and DP178 peptides involves 
5 preparing a reaction mixture containing peptides under 
conditions and for a time sufficient to allow the two 
peptides to interact and bind, thus forming a complex. 
In order to test a compound for disruptive activity, 
the reaction is conducted in the presence and absence 

10 of the test compound, i.e . , the test compound may be 
initially included in the reaction mixture, or added 
at a time subsequent to the addition of one of the 
binding partners; controls are incubated without the 
test compound or with a placebo. The formation of any 

^ complexes between the binding partners is then 

detected. The formation of a complex in the control 
reaction, but not in the reaction mixture containing 
the test compound indicates that the compound 
interferes with the interaction of the DP107 and DP178 
peptides . 

20 The assay for compounds that interfere with the 

interaction of the binding partners can be conducted 
in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring one of the 
binding partners onto a solid phase and detecting 

25 complexes anchored on the solid phase at the end of 
the reaction. In homogeneous assays, the entire 
reaction is carried out in a liquid phase. In either 
approach, the order of addition of reactants can be 
varied to obtain different information about the 
compounds being tested. For example, test compounds 
that interfere with the interaction between the 
binding partners, e.g. , by competition, can be 
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identified by conducting the reaction in the presence 
of the test substance; i.e. , by adding the test 
substance to the reaction mixture prior to or 
simultaneously with the binding partners. On the 
other hand, test compounds that disrupt preformed 
5 complexes, e.g. compounds with higher binding 

constants that displace one of the binding partners 
from the complex, can be tested by adding the test 
compound to the reaction mixture after complexes have 
been formed. The various formats are described 
10 briefly below. 

In a heterogeneous assay system, one binding 
partner, e.g. , either the DP107 or DP178 peptide, is 
anchored onto a solid surface, and its binding 
partner, which is not anchored, is labeled, either 
directly or indirectly (e.g., with a radioactive label 
such as 125 I, an absorption label such as biotin, or a 
fluorescent label such as fluorescein or rhodamine) . 
In practice, microtiter plates are conveniently 
utilized. The anchored species may be immobilized by 
non-covalent or covalent attachments. Non-covalent 
attachment may be accomplished simply by coating the 
solid surface with a solution of the protein and 
drying. Alternatively, an immobilized antibody 
specific for the protein may be used to anchor the 
protein to the solid surface. The surfaces may be 
25 prepared in advance and stored. 

In order to conduct the assay, the binding 
partner of the immobilized species is added to the 
coated surface with or without the test compound. 
After the reaction is complete, unreacted components 
3 0 are removed t£-g« * by washing) and any complexes 

formed will remain immobilized on the solid surface. 



20 
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The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the binding partner was pre -labeled, the 
detection of label immobilized on the surface 
indicates that complexes were formed. Where the 
5 binding partner is not pre-labeled, an indirect label 
can be used to detect complexes anchored on the 
surface; e.g. . using a labeled antibody specific for 
the binding partner (the antibody, in turn, may be 
directly labeled or indirectly labeled with a labeled 

10 anti-Ig antibody) . Depending upon the order of 

addition of reaction components, test compounds which 
inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a 
liquid phase in the presence or absence of the test 
compound, the reaction products separated from 
unreacted components, and complexes detected; e.g. , 
using an immobilized antibody specific for one binding 
partner to anchor any complexes formed in solution, 
and a labeled antibody specific for the other binding 

20 partner to detect anchored complexes. Again, 

depending upon the order of addition of reactants to 
the liquid phase, test compounds which inhibit complex 
or which disrupt preformed complexes can be 
identified. 

25 In an alternate embodiment of the invention, a 

homogeneous assay can be used. In this approach, a 
preformed complex of the DP107 and DP178 peptides is 
prepared in which one of the binding partners is 
labeled, but the signal generated by the label is 
quenched due to complex formation (see, e.g. , U.S. 
Patent No. 4,109,496 by Rubenstein which utilizes this 
approach for immunoassays) . The addition of a test 
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substance that competes with and displaces one of the 
binding partners from the preformed complex will 
result in the generation of a signal above background. 
In this way, test substances which disrupt DP- 107/ 
DP- 178 protein-protein interaction can be identified. 
5 In still another embodiment of the invention, 

fluorescence polarization may be used in a homogenous 
assay. In this approach, complex formation is 
detected by measuring the polarization of a 
fluorescently labeled peptide (e.g., with fluorescein 
10 or rhodamine) in a sample. Binding of the peptide to 
its complementary HR1 or HR2 binding domain in a 
larger molecular weight peptide or protein, such as in 
a maltose binding fusion protein described herein, 
alters the correlation time of the fluorescent moiety 
is and thereby decreases the fluroescence polarization of 
the labeled peptide. 

In an alternative screening assay, test compounds 
may be assayed for the their ability to disrupt a 
DP178/DP107 interaction, as measured immunometrically 
using an antibody specifically reactive to a 

2 0 

DP107/DP178 complex ( i.e. . an antibody that recognizes 
neither DP107 nor DP178 individually) . Such an assay 
acts as a competition assay, and is based on 
techniques well known to those of skill in the art. 

The above competition assay may be described, by 
25 way of example, and not by way of limitation, by using 
the DP178 and M41A178 peptides and by assaying test 
compounds for the disruption of the complexes formed 
by these two peptides by immunometrically visualizing 
DP178/M41M78 complexes via the human recombinant Fab, 
3o Fab-d, as described, below, in the Example presented 
in Section 8. M41A178 is a maltose binding fusion 
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protein containing a gp41 region having its DP178 
domain deleted, and is described, below, in the 
Example presented in Section 8 . 

Utilizing such an assay, M41A178 may be 
immobilized onto solid supports such as microtiter 
5 wells. A series of dilutions of a test compound may 
then be added to each M41A17 8 -containing well in the 
presence of a constant concentration of DP- 178 
peptide. After incubation, at, for example, room 
temperature for one hour, unbound DP-178 and test 

10 compound are removed from the wells and wells are then 
incubated with the DP178/M41A178 -specif ic Fab-d 
antibody. After incubation and washing, unbound Fab-d 
is removed from the plates and bound Fab-d is 
quant itated. A no- inhibitor control should also be 
conducted. Test compounds showing an ability to 
disrupt DP178/M41A178 complex formation are identified 
by their concentration-dependent decrease in the level 
of Fab-d binding. 

A variation of such an assay may be utilized to 
perform a rapid, high -throughput binding assay which 

20 is capable of directly measuring DP178 binding to 

M41A178 for the determination of binding constants of 
the ligand of inhibitory constants for competitors of 
DP178 binding. 

Such an assay takes advantage of accepted 

25 radioligand and receptor binding principles. (See, 
for example, Yamamura, H.I. et al., 1985, 
"Neurotransmitter Receptor Binding", 2nd ed. , Raven 
Press, NY.) As above, M41A178 is immobilized onto a 
solid support such as a microtiter well. DP178 
binding to M41A178 is then quantitated by measuring 
the fraction of DP178 that is bound as 125 I-DP178 and 
calculating the total amount bound using a value for 
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specific activity (dpm//ig peptide) determined for each 
labeled DP178 preparation. Specific binding to 
M41A178 is defined as the difference of the binding of 
the labeled DP178 preparation in the microtiter wells 
(totals) and the binding in identical wells 
5 containing, in addition, excess unlabeled DP178 
(nonspecif ics) . 

Because the binding affinity for native DP178 and 
DP107 is very high (including native DP178-like and 
DP107-like peptides from other species; e.g., 10 nM 
10 for DP178 in HIV-1, and 2 nM for T112 in RSV) , test 
compounds must exhibit high binding properties to 
interfere with or disrupt the DP178/DP107 binding 
interaction. Accordingly, in another non-limiting 
example of the above-described competitions assays , 
such assays can be performed using "modified" DP107 
and/or DP178 peptides {e.g., DP107 and/or DP178 
analogs) which have reduced binding affinities 
relatived to the unmodified "parent peptides". The 
use of such modified DP107 and DP178 peptides greatly 
increases the sensitivity of the competition assays of 

2 0 

the invention by identifying more compounds with 
inhibitory potential. The binding affinities of 
compounds identified in the assays can then be 
optimized, e.g., using standard medicinal chemistry 
techniques, to generate compounds that are more 

25 powerful inhibitors of DP107/DP178 complex formation 
and are therefore useful, e.g., as antiviral reagents. 
Alternatively, compounds identified in the competition 
assays using DP107 and/or DP178 analogs with reduced 
binding affinities may, themselves, be useful, e.g., 

30 as antiviral reagents. 
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The term "reduced affinity," as used herein, 
refers to a DP107, DP178, DP107-like or DP178-like 
peptide that interacts with and forms a DP107/DP178 
peptide pair, a HR1/DP178 pair or an HR2/DP107 pair 
under competition assay conditions, but interacts with 
5 its "partner" to form such a pair with a lower 

affinity than would a DP107 or DP178 "parent" peptide 
from which the reduced affinity peptide is derived. 

Generally, the binding affinity of a peptide can 
be expressed as a B 50 value, i.e., the concentration of 
10 peptide necessary for 50% of the peptide molecules to 
bind to their target under a given set of conditions. 
Preferably, the B 50 value of a reduced affinity peptide 
will by at least twice, and more preferably at least 
five times, at least 10 times, at least 20 times, or 
at least 100 times the B so value of the unmodified 
peptide from which it was derived. 

Modified DP107 and DP178 peptides that have 
reduced binding affinities may be generated according 
to any number of techniques that will be readily 
apparent to those skilled in the art. For example, in 
one embodiment modified DP107 and DP178 peptides with 
reduced binding affinities may be generated by 
generating truncated DP107 and DP178 peptides, 
respectively. Such peptides may be routinely 
synthesized and tested, e.g., by the above described 
25 screening assays, to determine their binding 

affinities to their target. For example, as described 
in the example presented below in Section 30, reducing 
the length of the native RSV DP178-like peptide T112 
from 35 to 28 amino acid residues resulted in a five 
fold drop in binding affinity (from 1 nM to 5 nM) . 
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Generally, such truncation can be of 1, 2, 3, 4, 5, 6, 
7, 8, 9 or 10 amino acid residues. 

Alternatively, modified DP107 and DP178 peptides 
with reduced binding affinity may be identified and 
generated by identifying and substituting, inserting 
5 or deleting amino acid residues. For example in one 
embodiment, which is also demonstrated in the example 
presented below in Section 30, modified DP107 and/or 
DP178 peptides may be routinely synthesized and 
assayed for reduced binding affinity by systematically 
10 replacing one or more amino acid residues of the 

native DP107 or DP178 peptide with other amino acid 
residues and testing the binding affinity of the 
resulting peptide by techniques such as those 
described herein. Preferably, the substituted amino 
^ acid residues are neutral amino acid residues 

exhibiting relatively small side chains, such as 
alanine or glycine. 

Such substitutions can identify "key" amino acid 
residues and can be used in the competition assays of 
the invention. Alternatively, upon identification of 
20 key residues by such systematic substitutions, the key 
residues can be changed to other residues and the 
resulting, modified peptides can be tested for binding 
affinity. 

Modified DP107 and/or DP178 peptides that have 
25 reduced binding affinities may still further be 

identified using principles of protein chemistry and 
design that are well known to those of skill in the 
art. Specifically, such principles may be used to 
identify those amino acid residues of a native DP107 
3Q or DP178 sequence that effect, e.g., solubility, 

binding affinity, or stability of the peptide. Thus, 
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for example, using known principles of amino acid 
chemistry and protein design one skilled in the art 
could identify amino acid residues in a native DP107 
or DP178 peptide that affect the structure of the 
peptide. 

5 

5.7 PHARMACEUTICAL FORMULATIONS, DOSAGES 
AND MODES OF ADMINISTRATION 

The peptides of the invention may be administered 

using techniques well known to those in the art. 

Preferably, agents are formulated and administered 

10 

systemically . Techniques for formulation and 
administration may be found in "Remington's 
Pharmaceutical Sciences" , 18th ed., 1990, Mack 
Publishing Co., Easton, PA. Suitable routes may 
include oral, rectal, transmucosal, or intestinal 

15 administration; parenteral delivery, including 
intramuscular , subcutaneous , intramedullary 
injections, as well as, intrathecal, direct 
intraventricular , intravenous , intraperitoneal , 
intranasal, or intraocular injections, just to name a 

2 0 few. For injection, the agents of the invention may 
be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks' 
solution, Ringer 1 s solution, or physiological saline 
buffer. For such transmucosal administration, 
penetrants appropriate to the barrier to be permeated 

2 5 

are used in the formulation. Such penetrants are 
generally known in the art. 

In instances wherein intracellular administration 
of the peptides of the invention or other inhibitory 
agents is preferred, techniques well known to those of 
30 ordinary skill in the art may be utilized. For 
example, such agents may be encapsulated into 
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liposomes, then administered as described above. 
Liposomes are spherical lipid bilayers with aqueous 
interiors. All molecules present in an aqueous 
solution at the time of liposome formation are 
incorporated into the aqueous interior. The liposomal 
5 contents are both protected from the external 

microenvironment and, because liposomes fuse with cell 
membranes, are effectively delivered into the cell 
cytoplasm. Additionally, due to their hydrophobic ity, 
when small molecules are to be administered, direct 
10 intracellular administration may be achieved. 

Nucleotide sequences encoding the peptides of the 
invention which are to be intracellularly administered 
may be expressed in cells of interest, using 
techniques well known to those of skill in the art. 
For example, expression vectors derived from viruses 
such as retroviruses, vaccinia viruses, adeno- 
associated viruses, herpes viruses, or bovine 
papilloma viruses, may be used for delivery and 
expression of such nucleotide sequences into the 
targeted cell population. Methods for the 

2 0 

construction of such vectors and expression constructs 
are well known. See, for example, Sambrook et al . , 
1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor NY, and 
Ausubel et al . , 1989, Current Protocols in Molecular 

2 5 Biology, Greene Publishing Associates and Wiley 
Interscience, NY. 

With respect to HIV, peptides of the invention, 
particularly DP107 and DP178, may be used as 
therapeutics in the treatment of AIDS. In addition, 

30 tiie peptides may r>e used as prophylactic measures in 
previously uninfected individuals after acute exposure 
to an HIV virus. Examples of such prophylactic use of 
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the peptides may include, but are not limited to, 
prevention of virus transmission from mother to infant 
and other settings where the likelihood of HIV 
transmission exists, such as, for example, accidents 
in health care settings wherein workers are exposed to 
5 HIV- containing blood products. The successful use of 
such treatments do not rely upon the generation of a 
host immune response directed against such peptides. 

Effective dosages of the peptides of the 
invention to be administered may be determined through 
10 procedures well known to those in the art which 
address such parameters as biological half -life, 
bioavailability, and toxicity. Given the data 
presented below in Section 6, DP178, for example, may 
prove efficacious in vivo at doses required to achieve 
^ circulating levels of about 1 to about 10 ng per ml of 
peptide. 

A therapeutically effective dose refers to that 
amount of the compound sufficient to result in 
amelioration of symptoms or a prolongation of survival 
in a patient. Toxicity and therapeutic efficacy of 

20 such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or 
experimental animals, e.g. , for determining the LD S0 
(the dose lethal to 50% of the population) and the 
ED50 (the dose therapeutically effective in 50% of the 

2 5 population) . The dose ratio between toxic and 

therapeutic effects is the therapeutic index and it 
can be expressed as the ratio LD 50 /ED S0 . Compounds 
which exhibit large therapeutic indices are preferred. 
The data obtained from these cell culture assays and 

^ o animal studies can be used in formulating a range of 
dosage for use in humans. The dosage of such 
compounds lies preferably within a range of 
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circulating concentrations that include the ED50 with 
little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and 
the route of administration utilized. For any 
compound used in the method of the invention, the 
5 therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be 
formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 
( e.g. , the concentration of the test compound which 
10 achieves a half-maximal inhibition of the fusogenic 
event, such as a half -maximal inhibition of viral 
infection relative to the amount of the event in the 
absence of the test compound) as determined in cell 
culture. Such information can be used to more 
accurately determine useful doses in humans. Levels 

1 3 

in plasma may be measured, for example, by high 
performance liquid chromatography (HPLC) . 

The peptides of the invention may, further, serve 
the role of a prophylactic vaccine, wherein the host 
raises antibodies against the peptides of the 
invention, which then serve to neutralize HIV viruses 
by, for example, inhibiting further HIV infection. 

Administration of the peptides of the invention 
as a prophylactic vaccine, therefore, would comprise 
administering to a host a concentration of peptides 

25 effective in raising an immune response which is 
sufficient to neutralize HIV, by, for example, 
inhibiting HIV ability to infect cells. The exact 
concentration will depend upon the specific peptide to 
be administered, but may be determined by using 

3q standard techniques for assaying the development of an 
immune response which are well known to those of 
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ordinary skill in the art. The peptides to be used as 
vaccines are usually administered intramuscularly. 

The peptides may be formulated with a suitable 
adjuvant in order to enhance the immunological 
response. Such adjuvants may include, but are not 
5 limited to mineral gels such as aluminum hydroxide; 
surface active substances such as lysolecithin, 
pluronic polyols, polyanions; other peptides; oil 
emulsions; and potentially useful human adjuvants such 
as BCG and Corynebacterium parvum. Many methods may 
10 be used to introduce the vaccine formulations 

described here. These methods include but are not 
limited to oral, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, and 
intranasal routes. 

Alternatively, an effective concentration of 
polyclonal or monoclonal antibodies raised against the 
peptides of the invention may be administered to a 
host so that no uninfected cells become infected by 
HIV. The exact concentration of such antibodies will 
vary according to each specific antibody preparation, 
20 but may be determined using standard techniques well 
known to those of ordinary skill in the art. 
Administration of the antibodies may be accomplished 
using a variety of techniques, including, but not 
limited to those described in this section. 
25 For all such treatments described above, the 

exact formulation, route of administration and dosage 
can be chosen by the individual physician in view of 
the patient's condition. (See e.g. Fingl et al., 
1975, in "The Pharmacological Basis of Therapeutics", 

30 Ch - 1 Pl) ' 

It should be noted that the attending physician 

would know how to and when to terminate, interrupt, or 
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adjust administration due to toxicity, or to organ 
dysfunctions. Conversely, the attending physician 
would also know to adjust treatment to higher levels 
if the clinical response were not adequate (precluding 
toxicity) . The magnitude of an administrated dose in 
5 the management of the oncogenic disorder of interest 
will vary with the severity of the condition to be 
treated and the route of administration. The dose and 
perhaps dose frequency, will also vary according to 
the age, body weight, and response of the individual 
10 patient- A program comparable to that discussed above 
may be used in veterinary medicine. 

Use of pharmaceutical^ acceptable carriers to 
formulate the compounds herein disclosed for the 
practice of the invention into dosages suitable for 
systemic administration is within the scope of the 
invention. With proper choice of carrier and suitable 
manufacturing practice, the compositions of the 
present invention, in particular, those formulated as 
solutions, may be administered parenterally, such as 
by intravenous injection. The compounds can be 
formulated readily using pharmaceutically acceptable 
carriers well known in the art into dosages suitable 
for oral administration. Such carriers enable the 
compounds of the invention to be formulated as 
tablets, pills, capsules, liquids, gels, syrups, 
25 slurries, suspensions and the like, for oral ingestion 
by a patient to be treated. 

Pharmaceutical compositions suitable for use in 
the present invention include compositions wherein the 
active ingredients are contained in an effective 
3o amount to achieve its intended purpose. Determination 
of the effective amounts is well within the capability 

- Ill - 



&NSOOCID: <WO. 



.01S1673A2. I > 



WO 01/51673 



PCT/US00/35727 



of those skilled in the art, especially in light of 
the detailed disclosure provided herein. 

In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable 
pharmaceutical^ acceptable carriers comprising 
5 excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be 
used pharmaceutically. The preparations formulated 
for oral administration may be in the form of tablets, 
dragees, capsules, or solutions. 
10 The pharmaceutical compositions of the present 

invention may be manufactured in a manner that is 
itself known, e.g. , by means of conventional mixing, 
dissolving, granulating, dragee -making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing 
processes. 

15 

Pharmaceutical formulations for parenteral 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, 
suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable 

20 lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, 
such' as ethyl oleate or triglycerides, or liposomes. 
Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such 

25 as sodium carboxymethyl cellulose, sorbitol, or 

dextran. Optionally, the suspension may also contain 
suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

Pharmaceutical preparations for oral use can be 
obtained by combining the active compounds with solid 
excipient, optionally grinding a resulting mixture, 
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and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or 
dragee cores. Suitable excipients are, in particular, 
fillers such as sugars, including lactose, sucrose, 
mannitol, or sorbitol; cellulose preparations such as, 
5 for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl 
cellulose , hydroxypropyl methyl - cellulose , sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone 
(PVP) . If desired, disintegrating agents may be 
10 added, such as the cross -linked polyvinyl pyrrol idone, 
agar, or alginic acid or a salt thereof such as sodium 
alginate. 

Dragee cores are provided with suitable coatings. 
For this purpose, concentrated sugar solutions may be" 
i5 used, which may optionally contain gum arabic, talc, 
polyvinyl pyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, 
and suitable organic solvents or solvent mixtures . 
Dyestuffs or pigments may be added to the tablets or 
dragee coatings for identification or to characterize 
different combinations of active compound doses. 

Pharmaceutical preparations which can be used 
orally include push-fit capsules made of gelatin, as 
well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The 
25 push- fit capsules can contain the active ingredients 
in admixture with filler such as lactose, binders such 
as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In 
soft capsules, the active compounds may be dissolved 
3o or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycols. In 
addition, stabilizers may be added. 
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6. EXAMPLE: DPI 7 8 (SEQ ID:1) IS A POTENT 
INHIBITOR OF HIV-1 INFECTION 

In this example, DP178 (SEQ ID:1) is shown to be 

a potent inhibitor of HIV-1 mediated CD-4* cell -cell 

fusion and infection by cell free virus. In the 

5 fusion assay, this peptide completely blocks virus 

induced syncytia formation at concentrations of from 

1-10 ng/ml. In the infectivity assay the inhibitory 

concentration is somewhat higher, blocking infection 

at 90ng/ml. It is further shown that DP178 (SEQ ID:1) 

shows that the antiviral activity of DP178 (SEQ ID:1) 

is highly specific for HIV-1. Additionally, a 

synthetic peptide, DP-185 (SEQ ID:3) , representing a 

HIV-1 -derived DP178 homolog is also found to block 

HIV-l-mediated syncytia formation. 

* 6.1. MATERIALS AND METHODS 



6.1.1. PEPTIDE SYNTHESIS 
Peptides were synthesized using Fast Moc 
chemistry on an Applied Biosystems Model 431A peptide 

20 synthesizer. Generally, unless otherwise noted, the 
peptides contained amidated carboxy termini and 
acetylated amino termini. Amidated peptides were 
prepared using Rink resin (Advanced Chemtech) while 
peptides containing free carboxy termini were 

25 synthesized on Wang (p-alkoxy-benzyl -alcohol) resin 
(Bachem) . First residues were double coupled to the 
appropriate resin and subsequent residues were single 
coupled. Each coupling step was followed by acetic 
anhydride capping. Peptides were cleaved from the 
resin by treatment with trif luoracetic acid (TFA) 

10 (10ml), H 2 0 (0.5ml), thioanisole (0.5ml), ethanedithiol 
(0.25ml), and crystalline phenol (0.75g). Purifi- 
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cation was carried out by reverse phase HPLC. 
Approximately 50mg samples of crude peptide were 
chromatographed on a Waters Delta Pak C18 column (19mm 
x 3 0cm, ISfi spherical) with a linear gradient; 
H 2 0/acetonitrile 0.1% TFA. Lyophilized peptides were 
5 stored desiccated and peptide solutions were made in 
water at about lmg/ml. Electrospray mass spectrometry 
yielded the following results: DP178 (SEQ 
ID : 1) : 4491 . 87 ( calculated 4491 . 94 ) ; DP-180 (SEQ 
ID:2) :4491.45 (calculated 4491 . 94 ) ; DP-185 (SEQ 
10 ID:3) mot done (calculated 4546.97). 



15 



20 



25 



30 
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6.1.2. VIRUS 
The HIV-l^ virus was obtained from R. Gallo 
(Popovic, M. et al, . 1984, Science 224-497-508) and 
propagated in CEM cells cultured in RPMI 1640 
5 containing 10% fetal calf serum. Supernatant from the 
infected CEM cells was passed through a 0.2/xm filter 
and the infectious titer estimated in a 
microinfectivity assay using the AA5 cell line to 
support virus replication. For this purpose, 25/il of 

10 serial diluted virus was added to ISfil AA5 cells at a 
concentration of 2 x l0 5 /ml in a 96-well microtitre 
plate. Each virus dilution was tested in triplicate. 
Cells were cultured for eight days by addition of 
fresh medium every other day. On day 8 post 
infection, supernatant samples were tested for virus 
replication as evidenced by reverse transcriptase 
activity released to the supernatant. The TCID 50 was 
calculated according to the Reed and Muench formula 
(Reed, L.J. et al. , 1938, Am. J. Hyg. 27:493-497). 
The titer of the HIV-l^ and HIV-1^ stocks used for 

20 these studies, as measured on the AA5 cell line, was 
approximately 1.4 x 10 6 and 3.8 x 10 4 TCID 50 /ml, 
respectively. 

6.1.3. CELL FUSION ASSAY 
Approximately 7 x 10* Molt cells were incubated 
25 with 1 x 10 4 CEM cells chronically infected with the 
HIV-l^ virus in 96-well plates (one-half area cluster 
plates; Costar, Cambridge, MA) in a final volume of 
100/xl culture medium as previously described 
(Matthews, T.J. et al . . 1987, Proc. Natl. Acad. Sci. 
USA 84: 5424-5428). Peptide inhibitors were added in 
a volume of 10/xl and the cell mixtures were incubated 
for 24 hr. at 37°C. At that time, multinucleated 
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giant cells were estimated by microscopic examination 
at a 4 Ox magnification which allowed visualization of 
the entire well in a single field, 

6.1.4. CELL FREE VIRUS INFECTION ASSAY 
Synthetic peptides were incubated at 3 7°C with 
either 247 TCID 50 (for experiment depicted in FIG. 2), 
or 62 TCID 50 (for experiment depicted in FIG. 3) units 
of HIV-luu virus or 25 TCID 50 units of HIV-2 NIH2 and CEM 
CD4* cells at peptide concentrations of 0, 0.04, 0.4, 
4.0, and 4 0/^g/ml for 7 days. The resulting reverse 
transcriptase (RT) activity in counts per minute was 
determined using the assay described, below, in 
Section 6.1.5. See, Reed, L.J. et al . , 1938, Am. J. 
Hyg. 2J7 : 4 93-497 for an explanation of TCID 50 
calculations . 

6.1.5. REVERSE TRANSCRIPTASE ASSAY 
The micro-reverse transcriptase (RT) assay was 
adapted from Goff et al . (Goff, S. et al . . 1981, J. 
Virol. 38:239-248) and Willey et al . (Willey, R. et 
al., 1988, J. Virol. 62:139-147). Supernatants from 
virus/cell cultures are adjusted to 1% Triton-XlOO. A 
10//1 sample of supernatant was added to 50/xl of RT 
cocktail in a 96-well U-bottom microtitre plate and 
the samples incubated at 3 7°C for 90 min. The RT 
25 cocktail contained 75mM KC1, 2mM dithiothreitol, 5mM 
MgCl 2 , 5/xg/ml poly A (Pharmacia, cat. No. 27-4110-01), 
0.25 units/ml oligo dT (Pharmacia, cat. No. 27-7858- 
01), 0.05% NP40, 50mM Tris-HCl, pH 7.8, 0.5/zM non- 
radioactive dTTP, and 10/zCi/ml 32 P-dTTP (Amersham, cat. 
No. PB. 10167) . 

30 

After the incubation period, 40/xl of reaction 
mixture was applied to a Schleicher and Schuell (S+S) 

- 117 - 



10 



8NSDOCID: <WQ. 



0151673A2.L> 



WO 01/51673 



PCT/US00/35727 



NA45 membrane (or DE81 paper) saturated in 2 x SSC 
buffer (0.3M NaCl and 0.003M sodium citrate) held in a 
S+S Minifold over one sheet of GB003 (S+S) filter 
paper, with partial vacuum applied. Each well of the 
minifold was washed four times with 200/zl 2xSSC, under 
5 full vacuum. The membrane was removed from the 

minifold and washed 2 more times in a pyrex dish with 
an excess of 2xSSC. Finally, the membrane was drained 
on absorbent paper, placed on Whatman #3 paper, 
covered with Saran wrap, and exposed to film overnight 
10 at -70°C. 

6.2. RESULTS 

6.2.1. PEPTIDE INHIBITION OF INFECTED CELL- 

INDUCED SYNCYTIA FORMATION ■ - 

The initial screen for antiviral activity assayed 

15 peptides' ability to block syncytium formation induced 
by overnight co-cultivation of uninfected Molt4 cells 
with chronically HIV-1 infected CEM cells. The 
results of several such experiments are presented 
herein. In the first of these experiments, serial 

20 DP178 {SEQ ID:l) peptide concentrations between 

10//g/ml and 12.5ng/ml were tested for blockade of the 
cell fusion process. For these experiments, CEM cells 
chronically infected with either HIV-1^, HIV-1^, HIV- 
1 RF/ or HIV-1 SF2 virus were cocultivated overnight with 

2s uninfected Molt 4 cells. The results (FIG. 4) show 
that DP178 (SEQ ID:i) afforded complete protection 
against each of the HIV-l isolates down to the lowest 
concentration of DP178 (SEQ ID:1) used. For HIV^, 
inhibition, the lowest concentration tested was 
12.5ng/ml; for all other HIV-1 viruses, the lowest 

30 concentration of DP178 (SEQ ID;1) used in this study 
was lOOng/ml. A second peptide, DP-180 (SEQ ID:2), 
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containing the same amino acid residues as DP178 (SEQ 
ID:1) but arranged in a random order exhibited no 
evidence of anti-f usogenic activity even at the high 
concentration of 4 0/ig/ml (FIG . 4) . These observations 
indicate that the inhibitory effect of DP178 (SEQ 
5 ID:l) is primary sequence-specific and not related to 
non-specific peptide/protein interactions. The actual 
endpoint ( i.e. , the lowest effective inhibitory 
concentration) of DP178 inhibitory action is within 
the range of 1-10 ng/ml . 
10 The next series of experiments involved the 

preparation and testing of a DP178 (SEQ ID:1) homolog 
for its ability to inhibit HIV- 1- induced syncytia 
formation. As shown in FIG. 1, the sequence of DP- 185 
(SEQ ID: 3) is slightly different from DP178 (SEQ ID:1) 
in that its primary sequence is taken from the HIV-1 SF2 
isolate and contains several amino acid differences 
relative to DP178 (SEQ ID:1) near the N terminus. As 
shown in FIG. 4, DP-185 (SEQ ID:3), exhibits 
inhibitory activity even at 312 . 5ng/ml , the lowest 
concentration tested. 
20 The next series of experiments involved a 

comparison of DP178 (SEQ ID:1) HIV-1 and HIV-2 
inhibitory activity. As shown in FIG. 5, DP178 (SEQ 
ID:1) blocked HIV-l-mediated syncytia formation at 
peptide concentrations below Ing/ml . DP178 (SEQ ID:1) 
25 failed, however, to block HIV-2 mediated syncytia 

formation at concentrations as high as 10//g/ml. This 
striking 4 log selectivity of DP178 (SEQ ID:1) as an 
inhibitor of HIV-l-mediated cell fusion demonstrates 
an unexpected HIV-l specificity in the action of DP178 
3q (SEQ ID:1). DP178 (SEQ ID:1) inhibition of HIV-l- 
mediated cell fusion, but the peptide's inability to 
inhibit HIV-2 medicated cell fusion in the same cell 
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type at the concentrations tested provides further 
evidence for the high degree of selectivity associated 
with the antiviral action of DP178 (SEQ ID:1) . 

6.2.2. PEPTIDE INHIBITION OF INFECTION BY 
5 CELL -FREE VIRUS 

DP178 (SEQ ID:1) was next tested for its ability 

to block CD-4* CEM cell infection by cell free HIV-1 

virus. The results, shown in FIG . 2, are from an 

experiment in which DP178 {SEQ ID:1) was assayed for 

its ability to block infection of CEM cells by an 

10 

.HIV-i^j isolate. Included in the experiment were 
three control peptides, DP-116 (SEQ ID:9), DP-125 (SEQ 
ID:8), and DP-118 (SEQ ID:10). DP-116 (SEQ ID: 9) 
represents a peptide previously shown to be inactive* * 
using this assay, and DP-125 (SEQ ID: 8; Wild, C. et 

15 al., 1992, Proc. Natl. Acad, Sci. USA 89:10, 537) and 
. DP-118 (SEQ ID: 10) are peptides which have previously 
been shown to be active in this assay. Each 
concentration (0, 0.04, 0.4, 4, and 40^g/ml) of 
peptide was incubated with 247 TCID 50 units of HIV-l^j 

2 0 virus and CEM cells. After 7 days of culture, cell- 
free supernatant was tested for the presence of RT 
activity as a measure of successful infection. The 
results, shown in FIG. 2, demonstrate that DP178 (SEQ 
ID:i) inhibited the de novo infection process mediated 
by the HIV-l viral isolate at concentrations as low as 

25 

90ng/ml (IC50=90ng/ml) . In contrast, the two positive 
control peptides, DP-125 (SEQ: ID.-8) and DP-118 (SEQ 
ID:10), had over 60-fold higher IC50 concentrations of 
approximately 5//g/ml . 

In a separate experiment, the HIV-1 and HIV- 2 
30 inhibitory action of DP178 (SEQ ID:1) was tested with 
CEM cells and either HIV-1^ or HIV-2 NIHZ . 62 TCID 50 
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HIV-luu or 25 GCID S0 HIV-2 NIH2 were used in these 
experiments, and were incubated for 7 days. As may be 
seen in FIG. 3, DP178 (SEQ ID:1) inhibited HIV-1 
infection with an IC50 of about 31ng/ml. In contrast, 
DP178 (SEQ ID:1) exhibited a much higher IC50 for HIV- 
5 2 NIH2/ thus making DP178 (SEQ ID:1) two logs more potent 
as a HIV-l inhibitor than a HIV-2 inhibitor. This 
finding is consistent with the results of the fusion 
inhibition assays described, above, in Section 6.2.1, 
and further supports a significant level of 
10 selectivity ( i.e. , for HIV-1 over HIV-2) . 

7. EXAMPLE: THE HIV-1 INHIBITOR, 

DP178 (SEP ID:1) IS NON-CYTOTOXIC 

In this Example, the 36 amino acid synthetic 

peptide inhibitor DP178 (SEQ ID:1) is shown to be non- 

15 cytotoxic to cells in culture, even at the highest 

peptide concentrations (40^g/ml) tested. 

7.1. MATERIALS AND METHODS 
Cell proliferation and toxicity assay: 

20 Approximately 3.8x10 s CEM cells for each peptide 

concentration were incubated for 3 days at 3 7°C in T25 
flasks. Peptides tested were DP178 (SEQ ID:1] and DP- 
116 (SEQ ID:9), as described in FIG. 1. Peptides were 
synthesized as described, above, in Section 6.1. The 

25 concentrations of each peptide used were 0, 2.5, 10, 
and 4 0^g/ml. Cell counts were taken at incubation 
times of 0, 24, 48, and 72 hours. 

7.2 . RESULTS 
Whether the potent HIV-1 inhibitor DP178 (SEQ 
J0 ID:1) exhibited any cytotoxic effects was assessed by 
assaying the peptide's effects on the proliferation 
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and viability of cells in culture. CEM cells were 
incubated in the presence of varying concentrations of 
DP178 (SEQ ID:1), and DP-116 (SEQ ID:9), a peptide 
previously shown to be ineffective as a HIV inhibitor 
(Wild, C. et al . . 1992, Proc. Natl. Acad. Sci. USA 
5 89:10,537-10,541). Additionally, cells were incubated 
in the absence of either peptide. 

The results of the cytotoxicity study demonstrate 
that DP178 (SEQ ID:1) exhibits no cytotoxic effects on 
cells in culture. As can be seen, below, in Table VI, 
10 even the proliferation and viability characteristics 
of cells cultured for 3 days in the presence of the 
highest concentration of DP178 (SEQ ID:1) tested 
(40ptg/ml) do not significantly differ from the DP-116 
(SEQ ID: 9) or the no-peptide controls. The cell 
proliferation data is also represented in graphic form 
in FIG. 6. As was demonstrated in the Working Example 
presented above in Section 6, DP178 (SEQ ID:1) 
completely inhibits HIV-1 mediated syncytia formation 
at peptide concentrations between 1 and lOng/ml, and 
completely inhibits cell -f rise viral infection at 
20 concentrations of at least 90ng/ml. Thus, this study 
demonstrates that even at peptide concentrations 
greater than 3 log higher than the HIV inhibitory 
dose, DP178 (SEQ ID:1) exhibits no cytotoxic effects. 



25 



30 
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Table VI 



% Viability 
at time (hours) 

Peptide 

Peptide Concentration ua/ml 0 24 48 72 



DP178 
(SEQ 
ID: 1) 



10 



40 

10 
2.5 



98 

98 
98 



97 

97 
93 



95 

98 
96 



97 

98 
96 



15 



DP116 
(SEQ 
ID:9) 



40 

10 
2.5 



98 95 98 97 

98 95 93 98 
98 96 98 99 



No 
Peptide 



98 



97 99 98 



20 



25 



30 



8 . EXAMPLE: THE INTERACTION OF DP178 AND DP107 
Soluble recombinant forms of gp41 used in the 
example described below provide evidence that the 
DP178 peptide associates with a distal site on gp41 
whose interactive structure is influenced by the DP107 
leucine zipper motif. A single mutation disrupting 
the coiled-coil structure of the leucine zipper domain 
transformed the soluble recombinant gp41 protein from 
an inactive to an active inhibitor of HIV-1 fusion. 
This transformation may result from liberation of the 
potent DP178 domain from a molecular clasp with the 
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leucine zipper, DP107, determinant. The results also 
indicate that the anti-HIV activity of various gp41 
derivatives (peptides and recombinant proteins) may be 
due to their ability to form complexes with viral gp41 
and interfere with its fusogenic process. 

5 

8.1. MATERIALS AND METHODS 

8.1.1. CONSTRUCTION OF FUSION PROTEINS 
AND GP41 MUTANTS 

Construction of fusion proteins and mutants shown 

10 

in FIG. 7 was accomplished as follows: the DNA 
sequence corresponding to the extracellular domain of 
gp41 (540-686) was cloned into the Xmn I site of the 
expression vector pMal-p2 (New England Biolab) to give 
M41. The gp41 sequence was amplified from pgtat 

15 (Malim et al . , 1988, Nature 355: 181-183) by using 
polymerase chain reaction (PCR) with upstream primer 
5 1 -ATGACGCTGACGGTACAGGCC-3 1 (primer A) and downstream 
primer 5 1 -TGACTAAGCTTAATACCACAGCCAATTTGTTAT-3 ' (primer 
B) . M41-P was constructed by using the T7-Gen 

2 0 in vitro mutagenesis kit from United States 
Biochemicals (USB) following the suppliers 
instructions. The mutagenic primer (5'- 
GGAGCTGCTTGGGGCCCCAGAC-3 1 ) introduces an He to Pro 
mutation in M41 at position 578. M41A107, from which 
2s the DP-107 region has been deleted, was made using a 
deletion mutagenic primer 5'- 

C C AAATC C C C AGGAGCTG CTCGAG CTG CACT ATAC C AGAC - 3 1 (primer C ) 
following the USB T7-Gen mutagenesis protocol. 
M41A178, from which the DP- 178 region has been 
deleted, was made by cloning the DNA fragment 
30 corresponding to gp41 amino acids 54 0-642 into the 
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Xmn I site of pMal-p2. Primer A and 5'- 

ATAGCTTCTAGATTAATTGTTAATTTCTCTGTCCC-3 ' (primer D) were 
used in the PCR with the template pgtat to generate 
the inserted DNA fragments. M41-P was used as the 
template with primer A and D in PCR to generate M41- 
5 PA178. All inserted sequences and mutated residues 
were checked by restriction enzyme analysis and 
confirmed by DNA sequencing. 

8.1.2. PURIFICATION AND CHARACTERIZATION 
1Q OF FUSION PROTEINS 

The fusion proteins were purified according to 

the protocol described in the manufacturer's brochure 

of protein fusion and purification systems from New 

England Biolabs (NEB). Fusion proteins (10 ng) were* - 

analyzed by electrophoresis on 8% SDS polyacrylamide 

15 gels. Western blotting analysis was performed as 

described by Sambrook et al . , 1989, Molecular Cloning: 

A Laboratory Manual, 2d Ed, Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, NY, Ch. 18, 

pp. 64-75. An HIV-1 positive serum diluted 1000-fold, 

2 0 or a human Fab derived from repertoire cloning was 

used to react with the fusion proteins. .The second 

antibody was HRP- conjugated goat antihuman Fab. An 

ECL Western blotting detection system (Amersham) was 

used to detect the bound antibody. A detailed 

^ protocol for this detection system was provided by the 

manufacturer. Rainbow molecular weight markers 

(Amersham) were used to estimate the size of fusion 

proteins. 

8.1.3. CELL FUSION ASSAYS FOR ANTI-HIV ACTIVITY 
50 Cell fusion assays were performed as previously 

described (Matthews et al., 1987, Proc: Natl. Acad. 
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Sci. USA 84: 5424-5481). CEM cells (7 X 10 4 ) were 
incubated with HIV-1 IXIB chronically infected CEM cells 
(10 4 ) in 96-well flat -bottomed half-area plates 
(Cos tar) in 100 /il culture medium. Peptide and fusion 
proteins at various concentrations in 10 fil culture 
5 medium were incubated with the cell mixtures at 37°C 
for 24 hours. Multinucleated syncytia were estimated 
with microscopic examination. Both M41 and M41-P did 
not show cytotoxicity at the concentrations tested and 
shown in FIG. 8. 
10 Inhibition of HIV-1 induced cell-cell fusion 

activity was carried out in the presence of 10 nM 
DP178 and various concentrations of M41A178 or M41- 
PA178 as indicated in FIG. 9. There was no observable 
syncytia in the presence of 10 nM DP178. No peptide" 
or fusion protein was added in the control samples. 

8.1.4. ELISA ANALYSIS OF DP178 BINDING 

TO THE LEUCINE ZIPPER MOTIF OF GP41 

The amino acid sequence of DP178 used is : 

YTS LIHSLIEES QNQQEKNEQE LLE LDKWAS L WNWF . For enzyme 

20 linked immunoassay (ELISA), M41A178 or M41-PA178 (5 

/xg/ml) in 0.1M NaHC0 3 , pH 8.6, were coated on 96 wells 
Linbro ELISA plates (Flow Lab, Inc.) overnight. Each 
well was washed three times with distilled water then 
blocked with 3% bovine serum albumin <BSA) for 2 

25 hours. After blocking, peptides with 0.5% BSA in TBST 
(40 mM Tris-HCl pH7.5, 150 mM NaCl, 0.05% Tween 20) 
were added to the ELISA plates and incubated at room 
temperature for 1 hour. After washing three times 
with TBST, Fab-d was added at a concentration of 10 
ng/ml with 0.5% BSA in TBST. The plates were washed 

50 three times with TBST after incubation at room 

temperature for 1 hour. Horse radish peroxidase (HRP) 
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conjugated goat antihuman Fab antiserum at a 2000 fold 
dilution in TBST with 0.5% BSA was added to each well 
and incubated at room temperature for 45 minutes. The 
plates were then washed four times with TBST. The 
peroxidase substrate o-phenylene diamine (2.5 mg/ml) 
5 and 0.15% H 2 0 2 were added to develop the color. The 
reaction was stopped with an equal volume of 4.5 N 
H 2 S0 4 after incubation at room temperature for 10 
minutes. The optical density of the stopped reaction 
mixture was measured with a micro plate reader 
(Molecular Design) at 490 nm. Results are shown in 
FIG. 10. * 

8.2. RESULTS 

8.2.1. THE EXPRESSION AND CHARACTERIZATION 
OF THE ECTODOMAIN OF qp41 

As a step toward understanding the roles of the 

two helical regions in gp41 structure and function, 

the ectodomain of gp41 was expressed as a maltose 

binding fusion protein (M41) (FIG. 7). The fusogenic 

peptide sequence at the N-terminal of gp41 was omitted 

from this recombinant protein and its derivatives to 

improve solubility. The maltose binding protein 

facilitated purification of the fusion proteins under 

relatively mild, non-denaturing conditions. Because 

the M41 soluble recombinant gp41 was not glycosylated, 

lacked several regions of the transmembrane protein 

( i.e. , the fusion peptide, the membrane spanning, and 

the cytoplasmic domains) , and was expressed in the 

absence of gpl20, it was not expected to precisely 

reflect the structure of native gp41 on HIV-l virions. 

Nevertheless, purified M41 folded in a manner that 

preserved certain discontinuous epitopes as evidenced 

by reactivity with human monoclonal antibodies, 98-6, 
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126-6, and 50-69, previously shown to bind 
conformational epitopes on native gp41 expressed in 
eukaryotic cells (Xu et al., 1991, J. Virol. 65: 4832- 
4838; Chen, 1994, J. Virol . 68:2002-2010). Thus, at 
least certain regions of native gp41 defined by these 
5 antibodies appear to be reproduced in the recombinant 
fusion protein M4l. Furthermore, M41 reacted with a 
human recombinant Fab (Fab-d) that recognizes a 
conformational epitope on gp41 and binds HIV-1 virions 
as well as HIV-1 infected cells but not uninfected 
10 cells as analyzed by FACS. Deletion of either helix 
motif, i.e. , DP107 or DP178, of the M41 fusion protein 
eliminated reactivity with Fab-d. These results 
indicate that both helical regions, separated by 60 
amino acids in the primary sequence, are required to 
maintain the Fab-d epitope. 

15 

8.2.2. ANTI-HIV ACTIVITY OF THE 

RECOMBINANT ECTODOMAIN OF GP41 

The wild type M41 fusion protein was tested for 

anti-HIV-l activity. As explained, supra , synthetic 

2o peptides corresponding to the leucine zipper (DP107) 

and the C- terminal putative helix (DP178) show potent 

anti-HIV activity. Despite inclusion of both these 

regions, the recombinant M41 protein did not affect 

HIV-i induced membrane fusion at concentrations as 

high as 50 /iM (Table VII, below). 

25 



30 
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Table VII 

DISRUPTION OF THE LEUCINE ZIPPER OF 
GP41 FREES THE ANTI-HIV MOTIF 



DP 107 DPI 78 M41 M41-P M41-PA178 

Cell fusion 

(IQk,) 1 uM 1 nM >50 uM 83 nM >50 uM 

Fab-D 

binding (k D ) - - 3.5xl0" 9 2.5xlO B 

HIV infectiv- 

itydC^) luM 80nM >16uM 66 nM >8 uM 



l The affinity constants of Fab-d binding to the fusion proteins were determined 
using a protocol described by B. Friguet et aL, 1985, J. Immunol. Method. 
77:305-319. 

15 

- = No detectable binding of Fab-d to the fusion proteins. 



Antiviral Infectivity Assays. 20 ul of serially diluted virus stock was incubated 
for 60 minutes at ambient temperature with 20 u I of the indicated concentration 
of purified recombinant fusion protein in RPMI 1640 containing 10% fetal 
bovine serum and antibiotics in a 96-well microtiter plate. 20 ul of CEM4 cells 
at 6 x 10 5 cells/ml were added to each well, and cultures were incubated at 37°C 

2 0 in a humidified CO z incubator. Cells were cultured for 9 days by the addition of 

fresh medium every 2 to 3 days. On days 5, 7, and 9 postinfection, supernatant 
samples were assayed for reverse transcriptase (RT) activity, as described 
below, to monitor viral replication. The 50% tissue culture infectious dose 
(TCID 50 ) was calculated for each condition according to the formula of Reed & 
Muench, 1937, Am. J. Hyg. 27:493-497. RT activity was determined by a 
modification of the published methods of Goff et al., 1981, J. Virol. 38:239-248 
and Willey et al., 1988, J. Virol. 62:139-147 as described in Chen et al., 1993, 

25 AIDS Res. Human Retroviruses 9:1079-1086. 



Surprisingly, a single amino acid substitution, 
proline in place of isoleucine in the middle of the 
leucine zipper motif, yielded a fusion protein (M41-P) 
which did exhibit antiviral activity (Table XXV and 
Fig. 8). As seen in Table XXV, M41-P blocked syncytia 
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formation by 90% at approximately 85 nM and 
neutralized HIV-1 II1B infection by 90% at approximately 
70 nM concentrations. The anti-HIV-1 activity of M41- 
p appeared to be mediated by the C-terminal helical 
sequence since deletion of that region from M41-P 
5 yielded an inactive fusion protein, M41-PA178 (Table 
XXV) . This interpretation was reinforced by 
experiments demonstrating that a truncated fusion 
protein lacking the DP178 sequence, M41A178, abrogated 
the potent ant i- fusion activity of the DP178 peptide 

10 in a concent rat ion -dependent manner (FIG. 9) . The 
same truncated fusion protein containing the proline 
mutation disrupting the leucine zipper, M41-PA178, was 
not active in similar competition experiments (FIG. 
9). The results indicate that the DP178 peptide 
associates with a second site on gp41 whose 
interactive structure is dependent on a wild type 
leucine zipper sequence. A similar interaction may 
occur within the wild type fusion protein, M41, and 
act to form an intramolecular clasp which sequesters 
the DP178 region, making it unavailable for anti-viral 

20 activity. 

A specific association between these two domains 
is also indicated by other human monoclonal Fab-d 
studies. For example, Fab-d failed to bind either the 
DP178 peptide or the fusion protein M41A178, but its 

2 5 epitope was reconstituted by simply mixing these two 
reagents together (FIG. 10) . Again, the proline 
mutation in the leucine zipper domain of the fusion 
protein, M41-PA178, failed to reconstitute the epitope 
in similar mixing experiments. 
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9. EXAMPLE: METHOD FOR COMPUTER-ASSISTED 
IDENTIFICATION OF DP107-LIKE 
AND DP178-LIKE SEQUENCES 

A number of known coiled- coil sequences have been 
well described in the literature and contain heptad 
5 repeat positioning for each amino acid. Coiled-coil 
nomenclature labels each of seven amino acids of a 
heptad repeat A through G, with amino acids A and D 
tending to be hydrophobic positions. Amino acids E 
and G tend to be charged. These four positions (A, D, 
E, and G) form the amphipathic backbone structure of a 

10 

monomeric alpha-helix. The backbones of two or more 
amphipathic helices interact with each other to form 
di-, tri-, tetrameric, etc., coiled-coil structures. 
In order to begin to design computer search motifs, a- 
series of well characterized coiled coils were chosen 

15 including yeast transcription factor GCN4 , Influenza 
Virus hemagglutinin loop 36, and human proto- oncogenes 
c-Myc, c-Fos, and c-Jun. For each peptide sequence, a 
strict homology for the A and D positions, and a list 
of the amino acids which could be excluded for the B, 

20 E r F, and G positions (because they are not 

observed in these positions) was determined. Motifs 
were tailored to the DP107 and DP178 sequences by 
deducing the most likely possibilities for heptad 
positioning of the amino acids of HIV-1 Bru DP-107, 

25 which is known to have coiled-coil structure, and HIV- 
1 Bru DP178, which is still structurally undefined. 
The analysis of each of the sequences is contained in 
FIG. 12. For example, the motif for GCN4 was designed 
as follows: 

1. The only amino acids (using standard single 
30 letter amino acid codes) found in the A or D 

positions of GCN4 were [LMNV) . 
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2. All amino acids were found at B, C, E, F, and G 
positions except {CFGIMPTW}. 

3. The PESEARCH motif would, therefore, be written 
as follows : 

[LMNV] - { CFGIMPTW } (2)- [LMNV] -{CFGIMPTW} (3)- 
5 I LMNV] -{CFGIMPTW} (2) - [LMNV] - { CFGIMPTW } (3) - 

[LMNV] - {CFGIMPTW} (2) - [LMNV] -{ CFGIMPTW} (3) - 
[LMNV] -{CFGIMPTW} (2)- [LMNV] -{ CFGIMPTW} (3) 

Translating or reading the motif: "at the first A 
xo position either L, M, N, or V must occur; at positions 
B and C (the next two positions) accept everything 
except C, F, G, I, M, P, T, or W; at the D position 
either L, M, N, or V must occur; at positions E, F, 
and G (the next 3 positions) accept everything except 
C, F, G, I, M, P, T # or W." This statement is 
contained four times in a 2 8-mer motif and five times 
in a 35-mer motif. The basic motif key then would be: 
[LMNV] -{CFGIMPTW} . The motif keys for the remaining 
well described coiled- coil sequences are summarized in 
FIG. 12. 

20 The motif design for DP107 and DP178 was slightly 

different than the 28-mer model sequences described 
above due to the fact that heptad repeat positions are 
not defined and the peptides are both longer than 28 
residues. FIG. 13 illustrates several possible 

25 sequence alignments for both DP107 and DP178 and also 
includes motif designs based on 28-mer, 35-mer, and 
full-length peptides. Notice that only slight 
differences occur in the motifs as the peptides are 
lengthened. Generally, lengthening the base peptide 
results in a less stringent motif. This is very 
useful in broadening the possibilities for identifying 

- 132 - 



BMSOOCID <WO 0151673A2J > 



WO 01/51673 



PCI7US0O/35727 



DP107-or DP-178-like primary amino acid sequences 
referred to in this document as "hits". 

In addition to making highly specific motifs for 
each type peptide sequence to be searched, it is also 
possible to make "hybrid M motifs. These motifs are 
5 made by "crossing" two or more very stringent motifs 
to make a new search algorithm which will find not 
only both "parent" motif sequences but also any 
peptide sequences which have similarities to one, the 
other, or both "parents". For example, in FIG. 14 the 

10 "parent" sequence of GCN4 is crossed with each of the 
possible "parent" motifs of DP-107. Now' the hybrid 
motif must contain all of the amino acids found in the 
A and D positions of both parents, and exclude all of 
the amino acids not found in either parent at the 

^ other positions. The resulting hybrid from crossing 
GCN4 or [LMNV] {CFGIMPTW} and DP107 (28-mer with the 
first L in the D position) or [ILQT] {CDFIMPST} , is 
[ILMNQTV] {CFIMPT} . Notice that now only two basic 
hybrid motifs exist which cover both framing 
possibilities, as well as all peptide lengths of the 

20 parent DP-107 molecule. FIG. 15 represents the 
"hybridizations" of GCN4 with DP-178. FIG. 16 
represents the "hybridizations" of DP107 and DP178. 
It is important to keep in mind that the represented 
motifs, both parent and hybrid, are motif keys and not 

25 the depiction of the full-length motif needed to 
actually do the computer search. 

Hybridizations can be performed on any 
combination of two or more motifs. FIG. 17 
summarizes several three -motif hybridizations 
including GCN4, DP107 (both frames), and DP178 (also 
both frames) . Notice that the resulting motifs are 
now becoming much more similar to each other. In 

- 133 - 



0151673A2 I > 



WO 01/51673 



PCT/US00/35727 



fact, the first and third hybrid motifs are actually 
subsets of the second and fourth hybrid motifs 
respectively. This means that the first and third 
hybrid motifs are slightly more stringent than the 
second and fourth. It should also be noted that with 
5 only minor changes in these four motifs, or by 

hybridizing them, a single motif could be obtained 
which would find all of the sequences. However, it 
should be remembered that stringency is also reduced. 
Finally, the most broad-spectrum and least-stringent 
10 hybrid motif is described in FIG. 18 which summarizes 
the hybridization of GCN4 , DP107 (both frames), DP178 
(both frames), c-Fos, c-Jun, c-Myc, and Flu loop 36. 

A special set of motifs was designed based on the 
fact that DP- 178 is located only approximately ten 
^ amino acids upstream of the transmembrane spanning 

region of gp4l and just C-terminal to a proline which 
separates DP107 and DP178 . It has been postulated 
that DP178 may be an amphipathic helix when membrane 
associated, and that the proline might aid in the 
initiation of the helix formation. The same 
20 arrangement was observed in Respiratory Syncytial 
Virus; however, the DP178-like region in this virus 
also had a leucine zipper just C-terminal to the 
proline. Therefore, N-terminal proline-leucine zipper 
motifs were designed to analyze whether any other 
25 viruses might contain this same pattern. The motifs 
are summarized in FIG. 19. 

The PC/Gene protein database contains 5879 viral 
amino acid sequences (library file PVIRUSES; CD-ROM 
release 11.0). Of these, 1092 are viral enveloped or 
glycoprotein sequences (library file PVIRUSE1) . 
Tables V through XIV contain lists of protein sequence 
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names and motif hit locations for all the motifs 
searched. 

10. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
OP DP107 AND DP178-LIKE SEQUENCES 
IN HUMAN IMMUNODEFICIENCY VIRUS 

FIG. 20 represents search results for HIV-1 BRU 
isolate gp41 (PC/Gene protein sequence PENV_HV1BR) . 
Notice that the hybrid motif which crosses DP- 107 and 
DP-178 (named 107x178x4; the same motif as found in 
10 FIG ' 16 found three kits including amino acids 550- 

599, 636-688, and 796-823. These areas include DP-107 
plus eight N-terminal and four C-terminal amino acids; 
DP178 plus seven N-terminal and ten C-terminal amino 
acids; and an area inside the transmembrane region 
(cytoplasmic) . FIG. 20 also contains the results 
15 obtained from searching with the motif named ALLMOTI5, 
for which the key is found in FIG. 17 ( { CDGHP } 
(CFP}x5) . This motif also found three hits including 
DP107 (amino acids 510-599), DP178 (615-717), and a 
cytoplasmic region (772-841). These hits overlap the 
20 hits found by the motif 107x178x4 with considerable 
additional sequences on both the amino and carboxy 
termini. This is not surprising in that 107x178x4 is 
a subset of the ALLMOTI5 hybrid motif. Importantly, 
even though the stringency of ALLMOTI5 is considerably 
25 less than 107x178x4, it still selectively identifies 
the DP107 and DP178 regions of gp41 shown to contain 
sequences for inhibitory peptides of HIV-1. The 
results of these two motif searches are summarized in 
Table V of U.S. Patent Application Serial No. 
08/470,896 filed on June 6, 1995 (incorporated herein 
i0 by reference in its entirety) under the PC/Gene 

protein sequence name PENVJKV1BR. The proline-leucine 
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zipper motifs also gave several hits in HIV-1 BRU 
including 503-525 which is at the very C-terminus of 
gpl20, just upstream of the cleavage site (P7LZIPC and 
P12LZIPC) ; and 735-768 in the cytoplasmic domain of 
gp4l (P23LZIPC) . These results are found in Tables 
5 VIII, IX, and X under the same sequence name as 

mentioned above. Notice that the only area of HIV-1 
BRU which is predicted by the Lupas algorithm to 
contain a coiled-coil region, is from amino acids 635- 
670. This begins eight amino acids N-terminal to the 
10 start and ends eight amino acids N-terminal to the end 
of DP17B. DP107, despite the fact that it is a known 
coiled coil, is not predicted to contain a coiled-coil 
region using the Lupas method. 

11. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION 
15 OP DP107-LIKE AND DP178-LIKE 

SEQUENCES IN HUMAN RESPIRATORY 
SYNCYTIAL VIRUS 

FIG. 21 represents search results for Human 
Respiratory Syncytial Virus (RSV; Strain A2) fusion 
glycoprotein Fl (PC/Gene protein sequence name PVGLF_ 
HRSVA) . Motif 107x178x4 finds three hits including 
amino acids 152-202, 213-243, and 488-515. The 
arrangement of these hits is similar to what is found 
in HIV-l except that the motif finds two regions with 
similarities to DP-178, one just downstream of what 

25 would be called the DP107 region or amino acids 213- 
243, and one just upstream of the transmembrane region 
(also similar to DP178) or amino acids 488-515. Motif 
ALLMOTI5 also finds three areas including amino acids 
116-202, 267-302, and 506-549. The proline- leucine 

3 0 zipper motifs also gave several hits including amino 
acids 205-221 and 265-287 (P1LZIPC 265-280, P12LZIPC) , 
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and 484-513 (P7LZIPC and P12LZIPC 484-506, P23LZIPC) . 
Notice that the PLZIP motifs also identify regions 
which share location similarities with DP- 17 8 of HIV- 
1. 

12. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN SIMIAN I MMUNODEFICIENCY VIRUS 

Motif hits for Simian immunodeficiency Virus gp4l 
(AGM3 isolate; PC/Gene protein sequence name 
PENV_SIVAG) are shown in FIG. 22. Motif 107x178x4 
finds three hits including amino acids 566-593, 597- 
624, and 703-730. The first two hits only have three 
amino acids between them and could probably be 
combined into one hit from 566-624 which would 
represent a DP107-like hit. Amino acids 703 to 730 
15 would then represent a DP178-like hit. ALLMOTI5 also 
finds three hits including amino acids 556-628 (DP107- 
like) , 651-699 (DP178-like) , and 808-852 which 
represents the transmembrane spanning region. SIV 
also has one region from 655-692 with a high 
2 0 propensity to form a coiled coil as predicted by the 
Lupas algorithm. Both 107x178x4 and ALLMOTI5 motifs 
find the same region. SIV does not have any PLZIP 
motif hits in gp41. 

The identification of DP178/DP107 analogs for a 
second SIV isolate (MM251) is demonstrated in the 
Example presented, below, in Section 19. 



13. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178 LIKE SEQUENCES 
IN CANINE DISTEMPER VIRUS 



25 



3 0 Canine Distemper Virus (strain Onderstepoort ) 

fusion glycoprotein Fl (PC/Gene Protein sequence name 
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PVGLF_CDVO) has regions similar to Human RSV which are 
predicted to be DP107-like and DP178-like (FIG. 23) . 
Motif 107x178x4 highlights one area just C-terminal to 
the fusion peptide at amino acids 252-293. Amino 
acids 252-286 are also predicted to be coiled coil 
5 using the Lupas algorithm. Almost 100 amino acids C- 
terminal to the first region is a DP178-like area at 
residues 340-367. ALLMOTI5 highlights three areas of 
interest including: amino acids 228-2 97, which 
completely overlaps both the Lupas prediction and the 

10 DPl07-like 107x178x4 hit; residues 340-381, which 
overlaps the second 107x178x4 hit; and amino acids 
568-602, which is DP178-like in that it is located 
just N- terminal to the transmembrane region. It also 
overlaps another region (residues 570-602) predicted 
by the Lupas method to have a high propensity to form 
a coiled coil. Several PLZIP motifs successfully 
identified areas of interest including P6 and P12LZIPC 
which highlight residues 336-357 and 336-361 
respectively; PI and P12LZIPC which find residues 398- 
414; and P12 and P23LZIPC which find residues 562-589 

20 and 562-592 respectively. 



14. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP107-LIKE AND DP178-LIKE SEQUENCES 
IN NEWCASTLE DISEASE VIRUS 

FIG. 24 shows the motif hits found in Newcastle 
Disease Virus (strain Australia-Victoria/32 ; PC Gene 
protein sequence name P VGL F_ND VA ) . Motif 107x178x4 
finds two areas including a DP107-like hit at amino 
acids 151-178 and a DPl78-like hit at residues 426- 
512. ALLMOTI5 finds three areas including residues 
117-182, 231-272, and 426-512. The hits from 426-512 
include a region which is predicted by the Lupas 
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method to have a high coiled-coil propensity (460- 
503) . The PLZIP motifs identify only one region of 
interest at amino acids 273-289 (PI and 12LZIPC) . 

15. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION 
5 OF DP107-LIKE AND DP178-LIKE 

SEQUENCES IN HUMAN PARAINFLUENZA VIRUS 

Both motifs 107x178x4 and ALLMOTI5 exhibit 
DP107-like hits in the same region, 115-182 and 117- 
182 respectively, of Human Parainfluenza Virus (strain 
NIH 47885; PC/Gene protein sequence name PVGLF_pl3H4 ; 
(FIG. 25). In addition, the two motifs have a DP178- 
like hit just slightly C-terminal at amino acids 207- 
241. Both motifs also have DP178-like hits nearer the 
transmembrane region including amino acids 457-497 and 
4 62-512 respectively. Several PLZIP motif hits are 
15 also observed including 283-303 (P5LZIPC) , 283-310 

(P12LZIPC) , 453-474 (P6LZIPC) , and 453-481 (P23LZIPC) . 
The Lupas algorithm predicts that amino acids 122-176 
may have a propensity to form a coiled-coil. 

20 16. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 

DP107-LIKE AND DP178-LIKE SEQUENCES OF 
INFLUENZA A VIRUS 

FIG. 26 illustrates the Lupas prediction for a 
coiled coil in Influenza A Virus (strain A/Aichi/2/68) 
at residues 379-436, as well as the motif hits for 

25 107x178x4 at amino acids 387-453, and for ALLMOTI5 at 
residues 380-456. Residues 383-471 (38-125 of HA2) 
were shown by Carr and Kim to be an extended coiled 
coil when under acidic pH (Carr and Kim, 1993, Cell 
73: 823-832). The Lupas algorithm predicts a coiled- 

30 coil at residues 379-436. All three methods 

successfully predicted the region shown to actually 
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have coiled-coil structure; however, ALLMOTI5 
predicted the greatest portion of the 88 residue 
stretch. 

17. EXAMPLE: POTENTIAL RESPIRATORY SYNCYTIAL VIRUS 
5 DP178/DP107 ANALOGS: CD AND 

ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, respiratory 

syncytial virus (RSV) peptides identified by utilizing 

the computer-assisted search motifs described in the 

Examples presented in Sections 9 and 11, above, were 

10 tested for anti-RSV activity. Additionally, circular 
dichroism (CD) structural analyses were conducted on 
the peptides, as discussed below. It is demonstrated 
that several of the identified peptides exhibit potent 
antiviral capability. Additionally, it is shown that 

15 several of these peptides exhibit a substantial 
helical character. 

17.1 MATERIALS AND METHODS 
Structural analyses : The CD spectra were 
measured in a lOmM sodium phosphate, 150mM sodium 

2 0 

chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
spectrophotometer. Peptides were synthesized 
according to the methods described, above, in Section 

25 6.1. Peptide concentrations were determined from A^o 
using Edlehoch's method (1967, Biochemistry 6: 1948) . 

Ant i -RSV antiviral activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of HEp2 cells acutely infected 

30 with RSV ( i.e. , cells which are infected with a 

multiplicity of infection of greater than 2) to fuse 
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and cause syncytial formation on a monolayer of 
uninfected an uninfected line of Hep-2 cells. The 
lower the observed level of fusion, the greater the 
antiviral activity of the peptide was determined to 
be. 

5 Uninfected confluent monolayers of Hep-2 cells 

were grown in microtiter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 
[FBS; which had been heat inactivated for 3 0 minutes 
10 at 56°C; Bio Whittaker Cat. No. 14-501F) supplemented 
at 3%, antibiotics (penicillin/streptomycin; Bio 
Whittaker Cat. No. 17-602E) added at 1%, and glutamine 
added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of acutely infected Hep2 cells were 
washed with DPBS (Dulbecco's Phosphate Buffered Saline 
w/o calcium or magnesium; Bio Whittaker Cat. No. 17- 
512F) and cell monolayers were removed with Versene 
(1:5000; Gibco Life Technologies Cat. No. 15040-017). 
The cells were spun 10 minutes and resuspended in 3% 
20 FBS. Cell counts were performed using a 

hemacytometer. Persistent cells were added to the 
uninfected Hep-2 cells. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
25 uninfected Hep-2 cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 100 acutely 
RSV-infected Hep2 cells per well. Wells were then 
incubated at 3 7°C for 48 hours. 

After incubation, cells in control wells were 
3q checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of either 
Crystal Violet stain or XTT. With respect to Crystal 
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Violet, approximately 50/xl 0.25% Crystal Violet stain 
in methanol were added to each well. The wells were 
rinsed immediately, to remove excess stain, and were 
allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 
5 with respect to XTT (2 , 3-bis [2-Methoxy-4-nitro-5- 

sulfophenyl] -2H-tetrazolium-5-carboxyanilide inner 
salt), 50/xl XTT (lmg/ml in RPMI buffered with lOOmM 
HEPES , pH 7.2-7.4, plus 5% DMSO) were added to each 
well. The OD 450/690 was measured (after blanking against 
10 growth medium without cells or reagents, and against 
reagents) according to standard procedures. 

Peptides : The peptides characterized in the 
study presented herein were: 

1) peptides T-142 to T-155 and T-575, as shown in FIG. 
27A, and peptides T-22 to T-27, T-68, T-334 and T-371 

15 

to T-375 and T-575, as shown in FIG. 27B; 

2) peptides T-120 to T-141 and T-576, as shown in FIG. 
27B, and peptides T-12, T-13, T-15, T-19, T-28 to T- 
30, T-66, T-69, T-70 and T-576, as shown in FIG. 27D; 
and 

20 3) peptides T-67 and T-104 to T-119 and T-384, as 

shown in FIG. 28A, and peptides T-71, T-613 to T-617, 
T-662 to T-676 and T-730, as shown in FIG. 28B. 

The peptides of group 1 represent portions of the 
RSV F2 protein DP178/l07-like region. The peptides of 

25 group 2 represent portions of the RSV Fl protein 

DP107-like region. The peptides of groups 3 represent 
portions of the RSV Fl protein DP178-like region. 

Each peptide was tested at 2 -fold serial 
dilutions ranging from 100/ig/ml to approximately 
lOOng/ml. For each of the assays, a well containing 

30 

no peptide was also used. The IC 50 data for each 
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peptide represents the average of several experiments 
conducted utilizing that peptide. 

17.2 RESULTS 
The data summarized in FIGS. 27A-B and 28A-B 
5 represent antiviral and structural information 
obtained from peptides derived from the RSV F2 
DP178/DP107-like F2 region (FIG. 27A-B) , the RSV Fl 
DP-107-like region (FIG. 27C-D) and the RSV DP178-like 
F2 region (FIG. 28A-B) . 

10 As shown in FIGS. 27A-D, a number of the RSV 

DPl78/DP107-like peptides exhibited a detectable level 
of antiviral activity. Peptides from the RSV 
DP178/DP107-like F2 region (FIG. 27A-B) , for example, 
T-142 to T-145 and T-334 purfied peptides, exhibited 

^ detectable levels of antiviral activity, as evidenced 
by their IC 50 values. Further, a number of RSV Fl 
DPl07-like peptides (FIG. 27C-D) exhibited a sizable 
level of antiviral activity as purified peptides, 
including, for example, peptides T-124 to T-127, T- 
131, T-135 and T-137 to T-139, as demonstrated by 

20 their low IC S0 values. In addition, CD analysis FIG. 
27A, 27C) reveals that many of the peptides exhibit 
some detectable level of helical structure. 

The results summarized in FIG. 2 8A-B demonstrate 
that a number of DP178-like purified peptides exhibit 

25 a range of potent anti-viral activity. These peptides 
include, for example, T-67, T-104, T-105 and T-107 to 
T-119, as listed in FIG. 28A, and T-665 to T-669 and 
T-671 to T-673, as listed in FIG. 28B. In addition, 
some of the DP178-like peptides exhibited some level 
of helicity. 

Thus, the computer assisted searches described, 
hereinabove, successfully identified viral peptide 
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domains that represent highly promising anti-RSV 
ant iviral compounds . 

18. EXAMPLE: POTENTIAL HUMAN PARAINFLUENZA VIRUS 
TYPE 3 DP178/DP107 ANALOGS: CD AND 
ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, human 
parainfluenza virus type 3 (HPIV3) peptides identified 
by utilizing the computer- assisted search motifs 
described in the Examples presented in Sections 9 and 
15, above, were tested for anti-HPIV3 activity. 
Additionally, circular dichroism (CD) structural 
analyses were conducted on the peptides, as discussed 
below. It is demonstrated that several of the 
identified peptides exhibit potent antiviral 
capability. Additionally, it is shown that several of 
these peptides exhibit a substantial helical 
character. 



18.1 MATERIALS AND METHODS 
Structural analyses : Structural analyses 
20 consisted of circular dichroism (CD) studies. The CD 
spectra were measured in a lOmM sodium phosphate, 
150mM sodium chloride, pH 7.0, buffer at approximately 
lOmM concentrations, using a 1 cm pathlength cell on a 
Jobin/Yvon Autodichrograph Mark V CD 
^ spectrophotometer. Peptide concentrations were 

determined from A 2eo using Edlehoch's method (1967, 
Biochemistry 6:1948) . 

Anti-HP IV3 antiviral activity assays : The assay 
utilized herein tested the ability of the peptides to 
disrupt the ability of Hep2 cells chronically infected 
30 with HPIV3 to fuse and cause syncytial formation on a 
monolayer of an uninfected line of CV-1W cells. The 
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more potent the lower the observed level of fusion, 
the greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of CV-1W cells 
were grown in microti ter wells in 3% EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
5 Whittaker Cat. No. 12-125F], with fetal bovine serum 
[FBS; which had been heat inactivated for 3 0 minutes 
at 56°C; Bio Whittaker Cat. No. 14-501F) supplemented 
at 3%, antibiotics/antimycotics (Gibco BRL Life 
Technologies Cat. No. 15040-017) added at 1%, and 
10 glutamine added at 1%. 

To prepare Hep2 cells for addition to uninfected 
cells, cultures of chronically infected Hep2 cells 
were washed with DPBS (Dulbecco's Phosphate Buffered 
Saline w/o calcium or magnesium; Bio Whittaker Cat. 
No. 17-512F) and cell monolayers were removed with 
Versene (1:5000; Gibco Life Technologies Cat. No. 
15040-017) . The cells were spun 10 minutes and 
resuspended in 3% FBS. Cell counts were performed 
using a hemacytometer. Persistent cells were added to 
the uninfected CV-1W cells. 
20 The antiviral assay was conducted by, first, 

removing all media from the wells containing 
uninfected CV-1W cells, then adding peptides (at the 
dilutions described below) in 3% EMEM, and 500 
chronically HPIV3 -infected Hep2 cells per well. Wells 
25 w ere then incubated at 3 7°C for 24 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/xl 0.25% Crystal Violet stain in 
3q methanol. Wells were rinsed immediately, to remove 

excess stain and were then allowed to dry. The number 

- 145 - 



6NSDOCI0: <WO 0151673A2J_> 



WO 01/51673 



PCT/US00/35727 



of syncytia per well were then counted, using a 
dissecting microscope. 

Alternatively, instead of Crystal Violet 
analysis, cells were assayed with XTT, as described, 
avove, in Section 17.1. 
5 Peptides : The peptides characterized in the 

study presented herein were: 

1) Peptides 157 to 188, as shown in FIG. 29A, and 
peptides T-38 to T-40, T-42 to T-46 and T-582, as 
shown in FIG. 2 9B. These peptides are derived 

10 from the DP107 region of the HPIV3 Fl fusion 

protein (represented by HPF3 107, as shown in 
FIG. 2 9A) ; and 

2) Peptides 189 to 210, as shown in FIG. 30A, and T- 
269, T-626, T-383 and T-577 to T-579, as shown in 
FIG. 3 OB. These peptides are primarily derived 

15 

from the DP178 region of the HPIV3 Fl fusion 
protein (represented by HPF3 178, as shown in 
FIG. 3 OA) . Peptide T-626 contains two mutated 
amino acid resides (represented by a shaded 
background) . Additionally, peptide T-577 
20 represents Fl amino acids 65-100, T-578 

represents Fl amino acids 207-242 and T-579 
represents Fl amino acids 273-309. 

Each peptide was tested at 2-fold serial 
25 dilutions ranging from 500/ig/ml to approximately 

500ng/ml. For each of the assays, a well containing 
no peptide was also used. 

18 .2 RESULTS 
The data summarized in FIGS. 29A-C and 30A-B 

30 

represent antiviral and structural information 
obtained from peptides derived from the HPIV3 fusion 
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protein DP107-like region (FIG. 29A-C) and the HPIV3 
fusion protein DPl78-like region (FIG. 30A-B) . 

As shown in FIG. 2 9A-B, a number of the HPIV3 
DP107-like peptides exhibited potent levels of 
antiviral activity. These peptides include, for 
5 example, peptides T-40, T-172 to T-175, T-178, T-184 
and T-185. 

CD analysis reveals that a number of the peptides 
exhibit detectable to substantial level of helical 
structure. The CD spectra for one of the peptides, 
10 184, which exhibits substantial helicity is summarized 
in FIG. 29C. 

The results summarized in FIG. 3 0A-B demonstrate 
that a number of the DP178-like peptides tested 
exhibit a range of anti -viral activity. These 
peptides include, for example, peptides 194 to 211, as 
evidenced by their low IC 50 values. In fact, peptides 
2 01 to 2 05 exhibit IC S0 values in the nanogram/ml 
range. In addition, many of the DPl78-like peptides 
exhibited some level of helicity. 

Thus, the computer assisted searches described, 

2 0 

hereinabove, have successfully identified viral 
peptide domains that represent highly promising anti- 
HPIV3 antiviral compounds. 

19. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN SIMIAN 
IMMUNODEFICIENCY VIRUS 

FIG. 31 represents search results for SIV isolate 

MM2 51 (PC/Gene® protein sequence PENV_SIVM2) . Both 

107x178x4 and ALLMOTI5 search motifs identified two 

regions with similarities to DP107 and/or DP178 . 

3 0 The peptide regions found by 107x178x4 were 

located at amino acid residues 156-215 and 277-289. 
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The peptide regions found by ALLMOTI5 were located at 
amino acid residues 156-219 and 245-286. Both motifs, 
therefore, identify similar regions. 

Interestingly, the first SIV peptide region 
( i.e. , from amino acid residue 156 to approximately 
5 amino acid residue 219) correlates with a DP107 

region, while the second region identified ( i.e. , from 
approximately amino acid residue 245 to approximately 
amino acid residue 289) correlates with the DP178 
region of HIV. In fact, an alignment of SIV isolate 
10 MM2 51 and HIV isolate BRU, followed by a selection of 
the best peptide matches for HIV DP107 and DP178, 
reveals that the best matches are found within the 
peptide regions identified by the 107x178x4 and 
ALLMOTI5 search motifs. 

It should be noted that a potential coiled-coil 

15 

region at amino acid residues 242-282 is predicted by 
the Lupas program. This is similar to the observation 
in HIV in which the coiled-coil is predicted by the 
Lupas program to be in the DP178 rather than in the 
DP107 region. It is possible, therefore, that SIV may 
20 be similar to HIV in that it may contain a coiled-coil 
structure in the DP107 region, despite such a 
structure being missed by the Lupas algorithm. 
Likewise, it may be that the region corresponding to a 
DP178 analog in SIV may exhibit an undefined 

2 5 structure, despite the Lupas program's prediction of a 

coiled-coil structure. 

20. EXAMPLE: COMPUTER -ASS IS TED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN EPSTEIN- BARR 
VIRUS 

3 0 The results presented herein describe the 

identification of DP178/DP107 analogs within two 
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different Epstein-Barr Virus proteins. Epstein-Barr 
is a human herpes virus which is the causative agent 
of, for example, infectious mononucleosis (IM) , and is 
also associated with nasopharyngeal carcinomas (NPC) , 
Burkitt's lymphoma and other diseases. The virus 
5 predominantly exists in the latent form and is 
activated by a variety of stimuli. 

FIG. 32 depicts the search motif results for the 
Epstein-Barr Virus (Strain B95-8; PC/Gene® protein 
sequence PVGLB_EBV) glycoprotein gpllO precursor 
10 (gpll5) . The 107x178x4 motif identified two regions 
of interest, namely the regions covered by amino acid 
residues 95-122 and 631-658. One PZIP region was 
identified at amino acid residue 732-752 which is most 
likely a cytoplasmic region of the protein. The Lupas 
algorithm predicts a coiled-coil structure for amino 
acids 657-684. No ALLMOTI5 regions were identified. 

FIG. 33 depicts the search motif results for the 
Zebra (or EB1) trans-activator protein (BZLFl) of the 
above- identified Epstein-Barr virus. This protein is 
a transcription factor which represents the primary 
20 mediator of viral reactivation. It is a member of the 
b-ZlP family of transcription factors and shares 
significant homology with the basic DNA-binding and 
dimerization domains of the cellular oncogenes c-fos 
and C/EBP. The Zebra protein functions as a 
2 5 homodimer. 

Search results domonstrate that the Zebra protein 
exhibits a single region which is predicted to be 
either of DP107 or DP178 similarity, and is found 
between the known DNA binding and dimerization regions 
3Q of the protein. Specifically, this region is located 
at amino acid residues 193-220, as shown in FIG. 33. 
The Lupas program predicted no coiled-coil regions. 
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21. EXAMPLE: COMPUTER-ASSISTED IDENTIFICATION OF 
DP178/DP10 7 ANALOGS IN MEASLES VIRUS 

FIG. 34 illustrates the motif search results for 
the fusion protein Fl of measles virus, strain 
Edmonston (PC Gene® protein sequence PVGLF_MEASE) , 
5 successfully identifying DP178/DP107 analogs. 

The 107x178x4 motif identifies a single region at 
amino acid residues 228-262. The ALLMOTI5 search 
motif identifies three regions, including amino acid 
residues 116-184, 228-269 and 452-500. Three regions 
^ containing proline residues followed by a leucine 

zipper-like sequence were found beginning at proline 
residues 214, 286 and 451. 

The Lupas program identified two regions it 
predicted had potential for coiled-coil structure, 
which include amino acid residues 141-172 and 444-483. 

15 

22. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN HEPATITIS B 
VIRUS 

FIG. 35 depicts the results of a PZIP motif 

search conducted on the Hepatitis B virus subtype AYW. 

2 0 

Two regions of interest within the major surface 
antigen precursor S protein were identified. The 
first lies just C-terminal to the proposed fusion 
peptide of the major surface antigen (Hbs) which is 
found at amino acid residues 174-191. The second 
25 region is located at amino acid residues 233-267. The 
Lupas program predicts no coiled-coil repeat regions. 

In order to test the potential anti-HBV antiviral 
activity of these D178/DP107 analog regions, peptides 
derived from area around the analog regions are 

3 0 s y nthesi2e ^, as shown in FIG. 52A-B. These peptides 

represent one amino acid peptide "walks" through the 
putative DP178/DP107 analog regions. The peptides are 
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synthesized according to standard Fmoc chemistry on 
Rinkamide MBHA resins to provide for carboxy terminal 
blockade (Chang, CD. and Meinhofer, J., 1978, Int. J. 
Pept. Protein Res. 11:246-249; Fields, G.B. and Noble, 
R.L., 1990, Int. J. Pept. Protein Res. 35:161-214). 
5 Follwing complete synthesis, the peptide amino- 

terminus is blocked through automated acetylation and 
the peptide is cleaved with trif luoroacetic acid (TFA) 
and the appropriate scavengers (King, D.S. et al., 
1990, Int. J. Pept. Res. 36:255-266). After cleavage, 
10 the peptide is precipitated with ether and dried under 
vacuum for 24 hours. 

The anti-HBV activity of the peptides is tested 
by utilizing standard assays to determine the test 
peptide concentration required to cause an acceptable 
(e^g^, 90%) decrease in the amount of viral progeny 
formed by cells exposed to an HBV viral inoculum. 
Candidate antivial peptides are further characterized 
in model systems such as wood chuck tissue culture and 
animal sytems, prior to testing on humans. 

23. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN SIMIAN MASON - 
PFIZER MONKEY VIRUS __ 

The results depicted herein illustrate the 
results of search motifs conducted on the Simian 
Mason-Pfizer monkey virus. The motifs reveal 
DP178/DP107 analogs within the enveloped (TM) protein 
GP2 0, as shown in FIG. 36. 

The 107x178x4 motifs identifies a region at amino 
acid residues 422-470. The ALLM0TI5 finds a region at 
amino acid residues 408-474. The Lupas program 
predicted a coiled-coil structure a amino acids 424- 
459. 
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24. EXAMPLE: COMPUTER -ASSISTED IDENTIFICATION OF 
DP178/DP107 ANALOGS IN BACTERIAL 
PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding 
to sequences present in proteins of a variety of 
bacterial species . 

FIG. 3 7 depicts the search motif results for the 
Pseudomonas aeruginosa fimbrial protein (Pilin) . Two 
regions were identified by motifs 107x178x4 and 
ALLMOTI5. The regions located at amino acid residues 
10 30-67 and 80-144 were identified by the 107x178x4 

motif. The regions at amino acid residues 30-68 and 
80-125 were identified by the ALLMOTI5 . 

FIG. 3 8 depicts the search motif results for the. 
Pseudomonas gonorrhoeae f imbrial protein (Pilin) . A 
15 single region was identified by both the 107x178x4 and 
the ALLMOTI5 motifs. The region located at amino acid 
residues 66-97 was identified by the 107x178x4 motif. 
The region located at amino acid residues 66-125 were 
identified by the ALLMOTI5 search motif. No coiled- 
coil regions were predicted by the Lupas program. 

2 0 

FIG. 3 9 depicts the search motif results for the 
Hemophilus Influenza fimbrial protein (Pilin) . A 
single region was identified by both the 107x178x4 and 
the ALLMOTI5 motifs. The region located at amino acid 
residues 102-129 was identified by the 107x178x4 
25 motif. The region located at amino acid residues 102- 
14 8 were identified by the ALLMOTI5 search motif. No 
coiled- coil regions were predicted by the Lupas 
program. 

FIG. 40 depicts the search motif results for the 

3 0 Staphylococcus aureus toxic shock syndrome Hemophilus 

Influenza fimbrial protein (Pilin) . A single region 

. - 152 - 



BNSDOCID: <WO„ 0l5l673A2J_> 



WO 01/51673 



PCT/US00/35727 



was identified by both the 107x178x4 and the ALLMOTI5 
motifs. The region located at amino acid residues 
102-129 was identified by the 107x178x4 motif. The 
region located at amino acid residues 102-148 were 
identified by the ALLMOTI5 search motif. No coiled- 
5 coil regions were predicted by the Lupas program. 

FIG. 41 summarizes the motif search results 
conducted on the Staphylococcus aureus enterotoxin 
Type E protein. These results demonstrate the 
successful identification of DP178/DP107 analogs 
10 corresponding to peptide sequences within this 
protein, as described below. 

The ALLMOTI 5 motif identified a region at amino 
acid residues 22-27. The 107x178x4 motif identified 
two regions, with the first at amino acid residues 26- 
69 and the second at 88-115. A P12L2IPC motif search 
identified two regions, at amino acid residues 163-181 
and 230-250. 

The Lupas program predicted a region with a high 
propensity for coiling at amino acid residues 25-54. 
This sequence is completely contained within the first 
20 region identified by both ALLMOTI 5 and 107x178x4 
motifs . 

FIG. 42 depicts the search motif results 
conducted on a second Staphylococcus aureus toxin, 
enterotoxin A. Two regions were identified by the 

25 ALLMOTI 5 motif, at amino acid residues 22-70 and amino 
acid residues 164-205. The 107x178x4 motif found two 
regions, the first at amino acid residues 26-69 and 
the second at amino acid residues 165-192. A P23LZIPC 
motif search revealed a region at amino acid residues 

30 216 ~ 250 - No coiled-coil regions were predicted by the 
Lupas program. 
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FIG. 43 shows the motif search results conducted 
on the E. coli heat labile enterotoxin A protein, 
demonstrating that identification of DP178/DP107 
analogs corresponding to peptides located within this 
protein. Two regions were identified by the ALLMOTT5 
5 motif, with the first residing at amino acid residues 
55-115, and the second residing at amino acid residues 
216-254. The 107x178x4 motif identified a single 
region at amino acid residues 78-105. No coiled-coil 
regions were predicted by the Lupas program. 

10 

25. EXAMPLE: COMPUTER- ASSISTED IDENTIFICATION OF 
DPI 7 8 /DPI 0 7 ANALOGS WITHIN VARIOUS 
HUMAN PROTEINS 

The results presented herein demonstrate the 
identification of DP178/DP107 analogs corresponding to 
15 peptide sequences present within several different 
human proteins. 

FIG. 44 illustrates the search motif results 
conducted on the human c-fos oncoprotein. The 
ALLM0TI5 motif identified a single region at amino 
acid residues 155-193. The 107x178x4 motif identified 

20 

one region at amino acid residues 162-193. The Lupas 
program predicted a region at amino acid residues 148- 
201 to have coiled-coil structure. 

FIG. 45 illustrates the search motif results 
conducted on the human lupus KU autoantigen protein 

25 P70. The ALLM0TI5 motif identified a single region at 
amino acid residues 229-280. The 107x178x4 motif 
identified one region at amino acid residues 235-292. 
The Lupas program predicted a region at amino acid 
residues 232-267 to have coiled-coil structure. 

30 FIG. 46 illustrates the search motif results 

conducted on the human zinc finger protein 10. The 
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ALLMOTI5 motif identified a single region at amino 
acid residues 29-81. The 107x178x4 motif identified 
one region at amino acid residues 2 9-56. A P23LZIPC 
motif search found a single region at amino acid 
residues 420-457. The Lupas program predicted no 
5 coiled-coil regions . 

26. EXAMPLE: POTENTIAL MEASLES VIRUS DP178/DP107 
ANALOGS: CD AND ANTIVIRAL 
CHARACTE R I Z AT I ON 

In the Example presented herein, measles (MeV) 

10 virus DPl78-like peptides identified by utilizing the 

computer-assisted search motifs described in the 

Examples presented in Sections 9 and 21, above, are 

tested for anti-MeV activity. Additionally, circular 

dichroism (CD) structural analyses are conducted on 

15 the peptides, as discussed below. It is demonstrated 

that several of the identified peptides exhibit potent 

antiviral capability. Additionally, it is shown that 

none of the these peptides exhibit a substantial 

helical character . 

20 

26.1 MATERIALS AND METHODS 
Structural analyses : The CD spectra were 
measured in a lOmM sodium phosphate, 150mM sodium 
chloride, pH 7.0, buffer at approximately lOmM 
concentrations, using a 1 cm pathlength cell on a 

2 5 

Jobin/Yvon Autodichrograph Mark V CD 
spectrophotometer. Peptide concentrations were 
determined from A 2B0 using Edlehoch's method (1967, 
Biochemistry 6:1948) . 

Anti- MeV antiviral activity syncytial reduction 
30 assay: The assay utilized herein tested the ability 
of the peptides to disrupt the ability of Vero cells 
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acutely infected with MeV ( i.e. , cells which are 
infected with a multiplicity of infection of 2-3) to 
fuse and cause syncytial formation on a monolayer of 
an uninfected line of Vero cells. The more potent the 
peptide, the lower the observed level of fusion, the 
5 greater the antiviral activity of the peptide. 

Uninfected confluent monolayers of Vero cells 
were grown in microtiter wells in 10% FBS EMEM (Eagle 
Minimum Essential Medium w/o L-glutamine [Bio 
Whittaker Cat. No. 12-125F] , with fetal bovine serum 
10 [FBS; which had been heat inactivated for 30 minutes 
at 56°C; Bio Whittaker Cat. No. 14-501F) supplemented 
at 10%, antibiotics/antimycotics (Bio Whittaker Cat. 
No. 17-602E) added at 1%, and glutamine added at 1%. 

To prepare acutely infected Vero cells for 

addition to the uninfected cells, cultures of acutely 
15 , . 

infected Vero cells were washed twice with HBSS (Bio 

Whittaker Cat. No. 10-543F) and cell monolayers were 

removed with trypsin (Bio Whittaker Cat. No. 17-161E) . 

Once cells detached, media was added, any remaining 

clumps of cells were dispersed, and hemacytometer cell 

20 counts were performed. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and 50-100 

25 acutely MeV-infected Vero cells per well. Wells were 
then incubated at 37°C for a maximum of 18 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/xl 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 
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allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Anti-MeV antiviral activity plague reduction 
assay : The assay utilized herein tested the ability 
of the peptides to disrupt the ability of MeV to 
5 infect permissive, uninfected Vero cells, leading to 
the infected cells 1 fusing with uninfected cells to 
produce syncytia. The lower the observed level of 
syncytial formation, the greater the antiviral 
activity of the peptide. 
10 Monolayers of uninfected Vero cells are grown as 

described above. 

The antiviral assay was conducted by, first, 
removing all media from the wells containing 
uninfected Vero cells, then adding peptides (at the 
dilutions described below) in 10% FBS EMEM, and MeV 
stock virus at a final concentration of 30 plague 
forming units (PFU) per well. Wells were then 
incubated at 3 7°C for a minimum of 3 6 hours and a 
maximum of 4 8 hours. 

On day 2, after cells in control wells were 
checked for fusion centers, media was removed from the 
wells, followed by addition, to each well, of 
approximately 50/il 0.25% Crystal Violet stain in 
methanol. Wells were rinsed twice with water 
immediately, to remove excess stain and were then 
25 allowed to dry. The number of syncytia per well were 
then counted, using a dissecting microscope. 

Peptides: The peptides characterized in the 
study presented herein were peptides T-252A0 to T- 
256A0, T-257B1/C1, and T-258B1 to T-265B0, and T-266A0 
30 t0 T " 268A0 ' as shown in FIG. 47. These peptides 

represent a walk through the DP178-like region of the 
MeV fusion protein. 
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Each peptide was tested at 2- fold serial 
dilutions ranging from 100/zg/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
no peptide was also used. 

5 26.2 RESULTS 

The data summarized in FIG. 4 7 represents 
antiviral and structural information obtained via 
"peptide walks" through the DP178-like region of the 
MeV fusion protein. 

10 As shown in FIG. 47, the MeV DP178-like peptides 

exhibited a range of antiviral activity as crude 
peptides. Several of these peptides were chosen for 
purification and further antiviral characterization. 
The IC 50 values for such peptides were determined, as 
shown in FIG. 47, and ranged from 1.35/zg/ml (T- 
257B1/C1) to 0.072/*g/ml (T-265B1) . None of the DP178- 
like peptides showed, by CD analysis, a detectable 
level of helicity. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 

20 in Section 9, for example, successfully identified 

viral peptide domains that represent highly promising 
anti-MeV antiviral compounds. 



27. EXAMPLE: POTENTIAL SIV DP178/DP107 ANALOGS: 
ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, simian 

immunodeficiency virus (SIV) DP178-like peptides 

identified by utilizing the computer-assisted search 

motifs described in the Examples presented in Sections 

9, 12 and 19, above, were tested for anti-SIV 

activity. It is demonstrated that several of the 
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identified peptides exhibit potent antiviral 
capability. 

2 7.1 MATERIALS AND METHODS 
Ant i -SI V antiviral assays : The assay utilized 
5 herein were as reported in Langolis et al . (Langolis, 
A.J. et al., 1991, AIDS Research and Human 
Retroviruses 7:713-720). 

Peptides: The peptides characterized in the 
study presented herein were peptides T-391 to T-400, 
10 as shown in FIG. 48. These peptides represent a walk 
through the DPl78-like region of the SIV TM protein. 

Each peptide was tested at 2-fold serial 
dilutions ranging from 100/ig/ml to approximately 
lOOng/ml. For each of the assays, a well containing 
no peptide was also used. 

27.2 RESULTS 
The data summarized in FIG. 4 8 represents 
antiviral information obtained via "peptide walks" 
through the DP178-like region of the SIV TM protein. 
20 As shown in FIG. 48, peptides T-391 to T-400 were 

tested and exhibited a potent antiviral activity as 
crude peptides. 

Thus, the computer assisted searches described, 
hereinabove, as in for example, the Example presented 
25 in Section 9, for example, successfully identified 

viral peptide domains that represent highly promising 
ant i- SIV antiviral compounds. 

28. EXAMPLE : ANTI-VIRAL ACTIVITY OF DP107 AND DP- 
178 PEPT IDE TRUNCATIONS AND MUTATIONS 

The Example presented in this Section represents 

a study of the antiviral activity of DP107 and DP178 
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truncations and mutations. It is demonstrated that 
several of these DP107 and DP178 modified peptides 
exhibit substantial antiviral activity. 

28 . 1 MATERIALS AND METHODS 
5 Anti-HIV assays : The antiviral assays performed 

were as those described, above, in Section 6.1. 
Assays utilized HIV-l/IIIb and/or HIV-2 NIHZ isolates. 
Purified peptides were used, unless otherwise noted in 
FIGS. 49A-C. 

10 Peptides : The peptides characterized in the 

study presented herein were: 

1) FIGS . 49A-C present peptides derived from 
the region around and containing the DP178 
region of the HIV-1 BRU isolate. 
Specifically, this region spanned from gp41 
amino acid residue 615 to amino acid residue 
717. The peptides listed contain 
truncations of this region and/or mutations 
which vary from the DP178 sequence amino 
acid sequence. Further, certain of the 

20 peptides have had amino- and/or carboxy- 

terminal groups either added or removed, 
as indicated in the figures; and 

2) FIG. 50. presents peptides which represent 
truncations of DP107 and/or the gp4l region 

25 surrounding the DP107 amino acid sequence of 

HIV-1 BRU isolate. Certain of the peptides 
are unblocked or biotinylated, as indicated 
in the figure. 
Blocked peptides contained an acyl N-terminus and 
an amido C-terminus. 
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28.2 RESULTS 
Anti -HIV antiviral data was obtained with the 
group l DPl78-derived peptides listed in FIG. 49A-C. 
The full-length, non-mutant DP178 peptide (referred to 
in FIG. 49A-C as T20) results shown are for 4ng/ml. 
5 In FIG. 49A, a number of the DP178 truncations 

exhibited a high level of antiviral activity, as 
evidenced by their low IC 50 values. These include, for 
example, test peptides T-50, T-624, T-636 to T-641, T- 
645 to T-650, T-652 to T-654 and T-656. T-50 
10 represents a test peptide which contains a point 
mutation, as indicated by the residue* s shaded 
background. The HIV- l -derived test peptides exhibited 
a distinct strain-specific antiviral activity, in that 
none of the peptides tested on the HIV-2 NIHZ isolate 
demonstrated appreciable antti-HIV-2 antiviral 
activity. 

Among the peptides listed in FIG . 4 9B, are test 
peptides representing the amino (T-4) and carboxy (T- 
3) terminal halves of DP178 were tested. The amino 
terminal peptide was not active (IC 5C >4 00//g/ml) whereas 

2 0 

the carboxy terminal peptide showed potent antiviral 
activity (IC S0 = 3/xg/ml) . A number of additional test 
peptides also exhibited a high level of antiviral 
activity. These included, for example, T-61/T-102, T- 
217 to T-221, T-235, T-381, T-677, T-377, T-590, T- 

25 378, T-591, T-271 to T-272, T-611, T-222 to T-223 and 
T-60/T-224. Certain of the antiviral peptides contain 
point mutations and/or amino acid residue additions 
which vary from the DP178 amino acid sequence. 

In FIG. 49C, point mutations and/or amino and/or 

3q carboxy- terminal modifications are introduced into the 
DP178 amino acid sequence itself. As shown in the 
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figure, the majority of the test peptides listed 

exhibit potent antiviral activity. 

Truncations of the DP107 peptide (referred to in 

IG. 50 as T2l) were also produced and tested, as shown 

in FIG. 50. FIG. 50 also presents data concerning 
5 blocked and unblocked peptides which contain 

additional amino acid residues from the gp41 region in 
which the DP107 sequence resides. Most of these 
peptides showed antiviral activity, as evidenced by 
their low IC S0 values. 
10 Thus, the results presented in this Section 

demonstrate that not only do the full length DP107 and 
DP178 peptides exhibit potent antiviral activity, but 
truncations and/or mutant versions of these peptides 
can also possess substantial antiviral character. 

15 

29; EXAMPLE: POTENTIAL EPSTEIN-BARR DP178/DP107 

ANALOGS: ANTIVIRAL CHARACTERIZATION 

In the Example presented herein, peptides derived 
from the Epstein-Barr (EBV) DP-178/DP107 analog region 
of the Zebra protein identified, above, in the Example 
20 presented in Section 20 are described and tested for 
anti-EBV activity. It is demonstrated that among 
these peptides are ones which exhibit potential anti- 
viral activity. 

25 29.1 MATERIALS AND METHODS 

Electrophoretic Mobility Shift Assays (EMSA) : 
Briefly, an EBV Zebra protein was synthesized 
utilizing SP6 RNA polymerase in vitro transcription 
and wheat germ in vitro translation systems (Promega 
Corporation recommendations; Butler, E.T. and 

10 Chamberlain, M.J., 1984, J. Biol. Chem. 257:5772; 
Pelham, H.R.B. and Jackson, R.J. , 1976, Eur. J. 
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Biochem. 67:247). The in vitro translated Zebra 
protein was then preincubated with increasing amounts 
of peptide up to 250 ng/ml prior to the addition of 
10,000 to 20,000 c.p.m. of a 32 P-labeled Zebra response 
element DNA fragment. After a 20 minute incubation in 
5 the presence of the response element, the reaction was 
analyzed on a 4% non-denaturing polyacrylamide gel, 
followed by autoradiography, utilizing standard gel- 
shift procedures. The ability of a test peptide to 
prevent Zebra homodimer DNA binding was assayed by the 
10 peptide's ability to abolish the response element gel 
migration retardation characteristic of a protein- 
bound nucleic acid molecule. 

Peptides : The peptides characterized in this 
study represent peptide walks through the region 
containing, and flanked on both sides by, the 
DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. Specifically, the peptide walks covered the 
region from amino acid residue 173 to amino acid 
residue 24 6 of the EBV Zebra protein. 
iJ Each of the tested peptides were analyzed at a 

range of concentrations, with 150ng/ml being the 
lowest concentration at which any of the peptides 
exerted an inhibitory effect. 



25 29.2 RESULTS 

The EBV Zebra protein transcription factor 
contains a DP178/DP107 analog region, as demonstrated 
in the Example presented, above, in Section 20. This 
protein appears to be the primary factor responsible 

3Q for the reactivation capability of the virus. A 

method by which the DNA-binding function of the Zebra 
virus may be abolished may, therefore, represent an 
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effective antiviral technique. In order to identify 
potential anti-EBV DP178/DP107 peptides, therefore, 
peptides derived from the region identified in Section 
20, above, were tested for their ability to inhibit 
Zebra protein DNA binding. 
5 The test peptides 1 ability to inhibit Zebra 

protein DNA binding was assayed via the EMSA assays 
described, above, in Section 28.1. The data 
summarized in FIG. 51A-B presents the results of EMSA 
assays of the listed EBV test peptides. These 
10 peptides represent one amino acid "walks" through the 
region containing, and flanked on both sides by, the 
DP178/DP107 analog region identified in the Example 
presented in Section 20, above, and shown as shown in 
FIG. 33. As shown in FIG. 51A-B, the region from 
which these peptides are derived lies from EBV Zebra 
protein amino acid residue 173 to 246. A number of 
the test peptides which were assayed exhibited an 
ability to inhibit Zebra protein homodimer DNA 
binding, including 439, 441, 444 and 445. 

Those peptides which exhibit an ability to 
20 inhibit Zebra protein DNA binding represent potential 
anti-EBV antiviral compounds whose ability to inhibit 
EBV infection can be further characterized. 

30. EXAMPLE: IDENTIFICATION OF RSV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
AFFINITY 

In the example presented herein, peptides derived 

from the RSV DP178 analog T112 are described and 

tested for binding affinity to the DP107-like domain 

of the RSV Fl -protein. Particular peptides are 

3 0 identified that have a reduced binding affinity for 

their DP107-like target, and key amino acid residues 
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are identified the confer high binding affinity to the 
native peptide (i.e., to T112) . Such peptides are 
useful, e.g., in screening assays such as those 
described above in Section 5.6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
and DP178, and in providing guidance for generation of 
additional peptides exhibiting reduced affinity 
binding. 



30.1 MATERIALS AND METHODS 
10 A maltose binding fusion protein of the RSV Fl- 

protein (MF5.1) was constructed using methods similar 
to those described in Section 8.1.2, supra, for 
construction of the M41 fusion protein. Specifically, 
the DNA sequence corresponding amino acid residues 
15 142-302 of the RSV Fl protein was amplified by PCR and 
cloned into the Xmn I site of the expression vector 
pMal-p2 (New England Biolab) to give MF5.1. These 
amino acid residues correspond to the extracellular 
domain of the RSV Fl protein including its DP107 
region but excluding the DP178 region. 

The peptides characterized in the study presented 
herein were: T122, T800, T801, T802, T803, T804, 
T805, TB06, T807, T808, T809, T810, T811, T1669, 
T1670, T1671, T1672, T1673, T1680, T1681, T1682, T1683 
and T1684, as shown in FIG. 53. T112 represents the 
DPi78-like region of the RSV Fl protein. The other 
peptides characterized are modified DP178 proteins 
derived from T112 . 

Cell fusion assays were performed with each of 
the peptides as described in Section 17 above. The 
30 binding affinity of each peptide was also measured in 
a competitive binding assay described in Section 5.6.1 



20 



25 
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above, wherein the concentration of each peptide 
necessary to bind to the M5.1 fusion protein (i.e., 
the B 50 value} , and thereby disrupt binding of biotin 
labeled T112 (T888) to the fusion protein, was 
measured. 

5 

30.2 RESULTS 
T112 is a 35 amino acid residue peptide that 
corresponds to amino acid residues 482-516 of the RSV 
Fl protein and has the following amino acid sequence: 

10 

VFPSDEFDASISQVNEKINQSLAFIRKSDELLHNV 

The peptide represents the DPl78-like region of the 
RSV Fl protein and has substantial, antiviral activity 
against RSV as discussed in Section 17.2 above and 
shown in FIG. 2 8A. 

T112 analogs were generated according to at least 
three different strategies to generate peptides based 
on T112 that would still bind to the DP107-like domain 
of the RSV Fl protein but with a lower binding 

2 0 

affinity. First, a truncated peptide was generated, 
reducing the length of the peptide from 3 5 to 2 8 amino 
acid residues. Specifically, the truncated peptide, 
which is referred to herein as T67, had the amino acid 
sequence : 

25 

DE FDAS I S QVNEKINQ S LAF I RKS DELL 

corresponding to amino acid residues 486-213 of the Fl 
fusion protein. The binding affinity of the peptide 
3Q to the DP107-like domain of Fl protein was determined 
according to the methods described in Section 5.6.1 
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above. The truncated peptide had a binding affinity 
(5 nM) that was five times lower than that of the full 
length T112 peptide (2 nM) . 

As part of a second strategy, the peptides 
identified as T800 through T811 in FIG. 53 were 
5 synthesized to identify particular amino acids in T112 
that contribute to a larger part of that peptide's 
binding affinity. As a whole, these alanine 
substitutions represent an "alanine -scanning" type 
walk across the sequence of T112. 
10 Each of the peptides synthesized had a change of 

three consecutive amino acid residues in the T112 
sequence to three alanine residues. Each peptide was 
tested for its ability to inhibit the binding of the 
native peptide (i.e., of T112) in a competitive 
binding assay as described in Section 5.6.1 above. 
The results are also shown in FIG. 53. In particular, 
the peptides T802, T804, T807 and T810 had 
significantly reduced affinity for the DP107-like 
target, suggesting that the regions containing amino 
acid residues 488-490, 494-496, 503-505 and 512-514 of 

2 0 

the RSV Fl protein (amino acid residues 7-9, 13-15, 
22-24 and 31-33, respectively, of T112) , contribute 
significantly to the high binding affinity of T112 for 
its DPl07-like target in the RSV Fl protein. 

The peptides T1669-T1673 and T1680 through T1684 
25 were then synthesized, each of which contains a single 
alanine substitution at one of the above- listed amino 
acid residue positions of T112 . The binding affinity 
of these peptides for their DP107-like target can also 
be determined by means of the same routine screening 
30 assays, thereby identifying individual amino acid 
residues which affect binding affinity of T112. 
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In addition, an additional novel peptide, 
referred to as T786, was generated by modifying 
various amino acid residues in the T112 sequence which 
were identified, using standard principles of protein 
and design, as affecting properties such as binding 
5 affinity, solubility and biological stability. 
Specifically the following amino acid residue 
substitutions were made: F 2 - Y, S 21 - A, F 24 - Y and 
S 28 - A, wherein the subscript numerals indicate the 
amino acid residue position in T112. The resultant 
10 peptide, which is referred to herein as T78 6, thus had 
the amino acid sequence: 

VYPSDEFDAS ISQVNEKINQALAYIRKADELLHNV 

The binding affinity of this novel peptide for 
the DP107 target was found to be 19 nM, i.e., 
approximately ten-fold less than the binding affinity 
of T112 . 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 

2 0 

an HRl domain) may be readily found by modifying a 
DP178 peptide such as T112, e.g., by means of the 
routine techniques and assays described herein. 
Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
25 identify other reduced affinity peptides. 

31. EXAMPLE: IDENTIFICATION OF HIV DP107/DP178 
ANALOGS WITH REDUCED BINDING 
AFFINITY 

In the example presented herein, peptides derived 

30 from DP178, which is also referred to as T20, are 

described and tested for binding affinity to the DP107 
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domain of the HIV gp41. Particular peptides are 
identified that have a reduced binding affinity for 
their DP107 target, and key amino acid residues are 
identified the confer high binding affinity to the 
native peptide (i.e., to T20) . Such peptides are 
5 useful, e.g., in screening assays such as those 

described above in Section 5.6.1 to identify compounds 
which inhibit or disrupt the interaction between DP107 
and DP178. 

Specifically, the peptides identified as T813 and 
10 T868 through T878 in FIG. 53 were synthesized to 

identify particular amino acids in T20 (DP178) that 
contribute to a greater part of that peptide's binding 
affinity. Each of the peptides synthesized had a 
change of three consecutive amino acid residues in the 
15 T2 0 sequence to three alanine residues. The antiviral 
activity of each peptide was assayed in cell fusion 
assays as described in Section 6.1.3, above. The 
binding affinities of the peptides were also measured 
in a competitive binding assay described in Section 
5.6.1 above, wherein each peptides ability to disrupt 

20 

the binding of either biotin (T83) or fluorescein 
(T1342) labeled DP178 {T20} to the M41A178 fusion 
protein described in Section 8, above, was measured. 
The binding affinity of each peptide to the peptide 
referred to as T764 

2 5 ( GSTMGARSMTLTVQARQLLSGI VQQNNLLRAIEAQQH ) also measured 
using circular dichroism to monitor the amount of 
secondary structure (i.e., helicity) adopted by the 
peptides. T764 is a peptide which represents the 
DP107 target domain of DP178 (T20) . 

30 The results are provided in FIG. 54. In 

particular, the peptides T813, T878, T874-T876 and 
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T8 71 have significantly reduced affinity for the DP107 
region, suggesting the regions corresponding to the 
substituted amino acid residues in those peptides 
contribute significantly to the high binding affinity 
of T20. The peptides T1627-T1632, T1650-T1653 and 
5 T1656-T1665 were then synthesized. Each of these 

peptides contains a single alanine substitution at one 
of the amino acid residues in one of the regions 
identified as contributing significantly to the high 
binding affinity of T20. Identical assays which 
10 measured the binding affinity of these peptides 

identified four essential residues (I fi46 , Q 652 , Q 653 and 
N 656/ with the subscript numerals indicating the 
residue position in the HIV-1^ gp41 amino acid 
sequence) in which alanine-substitution completely 
prevented binding to the DP107 domain, as well as five 
residues (L 64l , I 642 / l 64S , e 657 and L 663 , with the 
subscript numerals indicating the residue) in which 
alanine-substitution position in the HIV-l^j gp41 
amino acid sequence) that reduced the binding affinity 
but did not actually block binding to the DP107 
20 domain. 

The data demonstrates that peptides having a 
reduced binding affinity for a DP107 target (i.e., for 
an HR1 domain) may be readily found by modifying a 
DP178 peptide such as T20, e.g., by means of the 
25 routine techniques and assays described herein. 

Further, the techniques and assays identify key amino 
acid residues which may be used to construct and 
identify other reduced affinity peptides. 



30 
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The present invention is not to be limited in 
scope by the specific embodiments described which are 
intended as single illustrations of individual aspects 
of the invention, and functionally equivalent methods 
and components are within the scope of the invention. 
5 Indeed, various modifications of the invention, in 
addition to those shown and described herein will 
become apparent to those skilled in the art from the 
foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of 
10 the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A method for identifying a compound that 
inhibits the formation of or disrupts a DP107/DP178 
5 complex comprising : 

(a) preparing, both in the presence and in the 
absence of a test compound, a reaction 
mixture containing a DP107 peptide and a 
DP178 peptide under conditions and for a 

10 time sufficient to allow formation of a 

DP107/DP178 complex; and 

(b) detecting the formation of a DP107/DP178 
complex both in the presence and in the 
absence of the test compound, 

wherein the . formation of a DP107/DP178 complex in the 
absence, but not in the presence of the test compound 
indicates that the compound inhibits the formation of 
or disrupts a DP107/DP178 complex. 

2. The method of Claim 1, wherein the DP107 
20 peptide or the DP178 peptide is a modified DP107 or 

DP178 peptide. 

3. The method of Claim 2 wherein the modified 
DP107 or DP178 peptide has a reduced binding affinity. 

25 
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